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Title; Expression of alphaHiHcroglobullns 

FIELD OF THE INVENTION 

The present invention relates to the expression of ft-macroglobu- 
5 Tins, derivatives and variants thereof, and especially the expression of the 
human cta-macroglobulin (a^M) in an active form in mammalian cells, and the 
expression of genetically engineered variants thereof. The use of such 
recombinant a-macroglobul ins, especially recombinant {^(ragM) and variants 
is described with examples from the fields of medicine for therapeutic 
10 purposes, and the development of novel defined growth media for propagation 
of mammalian cells in culture. 

BACKGROUND OF THE INVENTION. 

BIOCHEMISTRY OF tto-MACROGLOBULIN (afl). 

15 The proteinase binding glycoprotein o^M, which is synthesized in 

the liver, constitute together with the complement proteins C3, C4 and C5 a 
separate class of structurally and functionally related large plasma 
proteins. For a recent review see (Sottrup- Jensen, L. (1987) in: The Plasma 
Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, 

20 FL). 

Apart from C5 these proteins contain an internal B-cysteinyl- 
7-glutamyl thiol ester, which enables the proteolytically activated forms 
of ctgM, C3, and C4 to participate in characteristic covalent binding reactions 
(Sottrup- Jensen, L., et a!., (1980) FEBS Lett. 121: 275-280; Salvesen, G.S. 

25 and Barrett, A.J., (1981) Biochem. J. 18Z: 695-701). The thiol ester 
structure, which in the active proteins can be slowly cleaved by a number of 
small nitrogen nucleophiles, constitutes a unique type of postsynthetic 
modification of proteins, and plays a prominent role in the biological 
properties of a^M. The presence of the active thiol esters in o^M is revealed 

30 by a characteristic pattern of heat fragmentation (Harpel, P.C., et al., 
(1979) J. Biol. Chem. £54: 8859-8878). 

Traditionally, ajft has been studied within the context of plasma 
proteinase inhibitors, although by several criteria it is unique. Whereas 
most plasma proteinase inhibitors are monomeric proteins of roughly similar 

35 size, containing approximately 430-500 residues, is a tetramer whose 180- 
kD subunits contain 1451 residues (Sottrup-Jensen et al., (1984) J. Biol. 
Chem. 259: 8318-8327). 

Furthermore, in contrast to most other proteinase inhibitors, 
which form 1:1 complexes with serine proteinases engaging the active site 
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of the proteinase and the reactive site of the inhibitor, o^M forms complexes 
with a broad spectrum of proteinases differing in their substrate specifi- 
city and catalytic mechanism e.g.: trypsin, leucocyte elastase, chymotrypsin, 
pancreatic elastase, cathepsin G, plasmin, plasma kallikrein and thrombin. 

5 The second-order rate constant for association between these 

proteinases and varies by several orders of magnitude. Both 1:1 and 2:1 
proteinase-a^ complexes can be formed, and the disul fide-bridged dimer (360 
kD) appears to be the functional unit of afl (Sottrup- Jensen, L. (1987) in: 
The Plasma Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, 

lOOrlando, FL). Contrary to "classical" proteinase inhibitor complexes the otjM 
bound proteinase is still active, especially toward small synthetic 
substrates (Sottrup- Jensen, L. (1987) in: "The Plasma Proteins" (Putnam, 
F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL). 

The mechanism of proteinase binding by OjM has been described by 

15 the "trap" (Barrett, A.J. and Starkey, P.M. (1973) Biochem. J. 133: 709- 
724), where proteolytic cleavage of a particularly exposed peptide stretch 
near the middle of the 180-kD subunit (the "bait" region) results in a 
conformational change of the afl tetramer, thereby entrapping the proteina- 
se. The nature of the essentially irreversible proteinase complex formation 

20with ojl has long remained elusive. However, recent investigations show that 
a major fraction (typically > 80-90 % of the trapped proteinase is also cova- 
lently bound through epsilon-lysyl (proteinase) -7-glutamyl (o^M) bonds (Sottrup- 
Jensen, L. et al., (1981) FEBS Lett. 128: 127-132; Sand, O. et al., (1985) 
J. Biol. Chem. 260: 15723-15735; Pochon, F. et al., (1987) FEBS Lett. 217: 

25101-105). 

PHYSIOLOGICAL ASPECTS OF PROTEINASE-ail INTERACTIONS. 

Since the OjM-proteinase complexes are rapidly cleared from the 
circulation (Ohlsson, K. (1971) Acta Physiol. Scand. 81: 269-272; Imber, 
30 M.J. and Pizzo, S.V. (1981) J. Biol. Chem. £§6: 8134-8139.) a general role 
as a "clearing vehicle" for plasma proteinases has been envisaged. 

The main physiological targets may include proteinases of the 
coagulation and fibrinolysis systems and plasma kallikrein, and perhaps also 
proteinases like leucocyte elastase, cathepsin G and collagenases and other 
35 proteinases released during cellular turnover (Sottrup- Jensen, L. and 
Birkedal -Hansen, H. (1989) J. Biol. Chem. 264: 393-401). 

Although OjM may be largely confined to the vasculature in healthy 
uninflamed tissues, the inhibitor and its proteinase complexes are found at 
near plasma levels in inflammatory exudates of rheumatoid joints and gingival 
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crevicular fluids (Tollefsen, T. and Saltved, E. (1980) J. Periodont. Res. 
15: 96-106; Borth, W., et al., (1983) Ann. N. Y. Acad. Sc1. 421: 377-381). 

While plasma o^M appear to be synthesized in the liver (Schreiber, 
G. (1987) in: "The Plasma Proteins" (Putnam, F.W., ed) 2nd Ed., 5_: 294-363, 
5 Academic Press, Orlando, FL.) other sites of synthesis exist. Several cell 
strains in culture have been shown to produce aj\ including fibroblasts 
(Mosher, D.F., et al., (1977) J. Clin. Invest. 60: 1036-1045) and monocytes- 
/macrophages (Hovi, T., et al., (1977) J. Exp. Med. H5.: 1580-1589). 

Whereas hepatocytes and Kupffer cells of the liver are most 

10 important for clearance of a^M-proteinase complexes in plasma (Davidsen, 0., 
et al., (1985) Biochim. Biophys. Acta 846: 85-92), fibroblasts (Van Leuven, 
F., et al., (1979) J. Biol. Chem. 251: 5155-5160; Mosher, D.F. and Vaheri, 
A. (1980) Biochim. Biophys. Acta 627: 113-122) and macrophages (Debanne, 
M.T., et al., (1975) Biochim. Biophys. Acta 411: 295-304; Kaplan, J. and 

15 Nielsen, M.L. (1979) J. Biol. Chem. 254: 7323-7328) also possess receptors 
for o^M- proteinase complexes. 

These observations suggest that there may be a considerable 
extravascul ar turnover of o^M perhaps primarily carrying proteinases 
functioning in the cellular micro environment (Sottrup- Jensen, L. and 

20 Birkedal -Hansen, H. (1989) J. Biol. Chem. .264: 393-401). 

SUMMARY OF THE INVENTION 

Briefly stated, the present invention discloses a method for the 
production of recombinant o-macroglobulins, and especially human OgM, and 
25 variants thereof in an active form. 

Within a preferred embodiment, the cultured host cell is an 
eukaryotic cell such as a mammalian cell or cells derived from organisms 
such as insects, plants, yeast or other fungi, such as Aspergillus . 

The invention further relates to DNA sequences comprising a gene 
30 encoding for the expression of human o^M and variants thereof, vectors 
comprising such DNA sequences, and suitable hosts transformed with such 
vectors . 

Yet another aspect of the invention is the use of recombinant 
o«M and variants thereof as a protein carrier in enzyme replacement therapy 
35(ERT). 

Yet another aspect of the invention is the use of recombinant 
ofeM and variants thereof as a DNA carrier in gene therapy. 

Further aspects of the invention relates to the use of recom- 
binant o-macroglobulins, especially human ajt, and variants thereof as 
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constituents of growth media, either as an additive or co-expressed with a 
desired gene product. 

DEFINITIONS 

5 Prior to setting forth the invention it may be helpful for an 

understanding thereof to set forth definitions of certain terms to be used 
hereafter. 

Complementary DNA or cDNA: A DNA molecule or sequence which have been 
lOenzymatically synthesized from sequences present in a mRNA template. 

DNA Construct: A DNA molecule, or a clone of such a molecule, either single- 
or double- stranded, which may be isolated in partial form from a naturally 
occurring gene or which has been modified to contain segments of DNA which 
15 are combined and juxtaposed in a manner which would not otherwise exist in 
nature. 

Plasmid or Vector: A DNA construct containing genetic information which may 
provide for its replication when inserted into a host cell. A plasmid 
20 generally contains at least one gene sequence to be expressed in the host 
cell, as well as sequences encoding functions which facilitate such gene 
expression, including promoters and transcription initiation sites. It may 
be a linear or closed circular molecule. 

25 Joined: DNA sequences are said to be joined when the 5' and 3' ends of one 
sequence are attached by phosphodi ester bonds to the 3' and 5' ends, 
respectively, of an adjacent sequence. Joining may be achieved by such 
methods as ligation of blunt or cohesive termini, by synthesis of joined 
sequences through cDNA cloning, or by removal of intervening sequences 

30 through a process of directed mutagenesis. 

Variant: A peptide related to the original peptide, but wherein the amino 
acid sequence has been altered through mutation of the gene encoding the 
original peptide. 

35 
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ABBREVIATIONS 
AMINO ACIDS 



A 




Ala 


Al anine 


y 


s 


Val 


Valine 


5L 


- 


Leu 


Leucine 


j 




He 


Isoleucine 


p 




Pro 


Prol ine 


F 




Phe 


Phenyl al anine 


W 


_ 


Trp = 


Tryptophan 


10 M 


as 


Met 


Methionine 


G 




Gly 


Glycine 


s 




Ser = 


Serine 


T 




Thr 


Threonine 


c 




Cys 


Cysteine 


15 Y 




Tvr = 


Tyrosine 


N 


= 


Asn 


Asparagine 


Q 


a 


Gin 


Glut amine 


D 


B 


Asp 


Asoartic Acid 


E 


S 


Glu 


Glutamic Acid 


20 K 




Lys 


Lysine 


R 




Arg 


Arginine 


H 




His 


Histidine 


NUCLEIC ACID BASES 




25 A 




Adenine 




G 




Guanine 




C 


B 


Cytosine 




T 


B 


Thymine (only in DNA) 


U 




Uracil (only in RNA) 



30 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure la illustrates the construction of plasmid p!136. 

Figure lb illustrates the construction of plasmid pl!67. 
35 Figure 2 illustrates the structure of plasmid pi 167. 

Figure 3 illustrates a gel electrophoresis (10 - 20 % SDS-PAGE) 
of the thermal fragmentation products generated from and ro^M. 

Figure 4 illustrates a gel electrophoresis of the thermal 
fragmentation products generated from methyl amine treated OjM and rOgM. 
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Figure 5 illustrates a gel electrophoresis (SDS-PAGE) of the 
reaction products generated from trypsin treatment of otjM and rajl. 

Figure 6 illustrates a gel electrophoresis of the reaction produ- 
cts generated from trypsin treatment of methyl amine- treated o^M and rOjM. 
5 Figure 7 illustrates a "rate gel" electrophoresis of unreacted 

native -and trypsin treated OjM and rOjM. 

Figure 8 illustrates a "rate gel" electrophoresis of unreacted 

native -and methyl amine treated a& and ro^M. 

Figure 9 illustrates the chromatograms of otjM and ro^H on a 

lOSuperose 6 column. 

Figure 10 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from chymotrypsin treated human OjM, human 

PZP and rofcM-PZP. 

Figure 11 illustrates the gel electrophoresis (10 - 20 % reducing 
15 SDS-PAGE) of the reaction products from elastase treated human otjM, human 

PZP and rajM-PZP. 

Figure 12 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from trypsin treated human a^M, human PZP 
and rcfeM-PZP. 

20 Figure 13 illustrates the gel electrophoresis (10 - 20 % reducing 

SDS-PAGE) of the reaction products from Staphylococcus aureus Glu-specific 
protease treated human otjM, human PZP and rotjM-PZP. 

25 DETAILED DESCRIPTION OF THE INVENTION 

According to the invention there is provided a process for the 
production of a-macroglobulins, especially human ofe-macroglobulin, or 
fragments or derivatives, including variants thereof, wherein a functionally 
operative expression vector comprising a gene encoding for the expression of 

30a a-macroglobulin, especially human o^-macroglobulin, or fragments or 
derivatives thereof, including variants, or alleles of such a gene, is intro- 
duced into a suitable host capable of expressing said gene, said host is 
cultured in a suitable nutrient medium containing sources of assimilable 
carbon and nitrogen and other essential nutrients, and the expressed at- 

35macroglobulin, especially human otj-macroglobulin, or fragments or derivatives 
thereof is recovered. 

Many proteins synthesized particularly in mammalian cells undergo 
post-translational modification (processing) of one kind or the. other. 
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Depending on the final destination and on the specific function of a newly 
synthesized protein, it may go through a number of processing steps leading 
to covalent modifications such as e.g.: glycosylation, Y-carboxylation, B- 
hydroxylation, sulphatation, amidation, thiol ester formation, phosphory- 
Blation, proteolytic cleavage at precursor processing sites, fatty acylation 
(Rosner, M.R. (1986). in: "Mammalian Cell Technology 0 , (Thilly, W.G. ed), 
Butterworth Publishers, Stoneham, MA. : 63-89). 

Proteins of various sizes and with a variety of different post- 
-translational modifications have been successfully expressed in transformed 

10 heterologous mammalian host cells using recombinant DNA technology. A few 
examples: Human coagulation factors Vila and IX have been expressed in trans- 
formed BHK (Syrian Baby Hamster Kidney) cells with correct post-translational 
modifications such as -y-carboxylation and glycosylation (Thim, L. et al., 
(1988) Biochemistry 27: 7785-7793; Busby, S. et al., (1985) Nature 316: 271- 

15 273). Human Platelet-derived Growth Factor AB heterodimer has been expres- 
sed in transformed CHO (Chinese Hamster Ovary) cells with correct processing 
of the A and B chain precursors and correct assembly of the AB heterodimer. 
Human coagulation factor VIII has been expressed in transformed CHO cells 
with correct processing of the precursor leading to a two chain molecule that 

20 can be activated by thrombin and factor Xa (Kaufman, R.J. et al., (1988) J. 
Biol. Chem. 263: 6352-6362; Pittman, D.D. and Kaufman, R.J. (1988) Proc. 
Natl. Acad. Sci. USA 85: 2429-2433). 

So far, there have been no reports on the heterologous expression 
of proteins in which the formation of an active thiol ester is a prominent 

25 post-translational modification. 

The biosynthesis of the internal thiol ester in the third com- 
ponent (C3) of complement from rabbit has been investigated (Iijima, M. et 
al., (1984) J. Biochem. 96: 1539-1546). Rabbit liver mRNA was translated in 
vitro in a rabbit reticulocyte lysate system, and the synthesized C3 specific 

30 products did not incorporate radio labelled methylamine. On the other hand 
radio labelled iodoacetamide reacted with the synthesized C3 specific 
products; these results indicated the presence in the primary C3 specific 
translation product of a free thiol group instead of a reactive thiol ester. 
If a liver homogenate supernatant (S-13) including cytosol and microsomes was 

35 included, the C3 specific product could now incorporate methylamine. By 
increasing the concentration of the S-13 component(s), the incorporation of 
methylamine in C3 specific products was increased, and at the same time 
incorporation of iodoacetamide decreased. If the S-13 fraction was treated 
at 65°C for 5 min, the activity was completely lost. 
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The results from this investigation strongly suggest an involve- 
ment of a transglutarainase-like or other type, of enzyme in the posttransla- 
tional formation of an active thiol ester 1n rabbit C3. There are no similar 
investigations addressing the formation of the thiol ester in other a-macro- 
5 globulins, e.g. OjM, but from analogy and homology considerations, it is 
expected that a similar mechanism is responsible for the formation of thiol 
esters in other er-macroglobulins synthesized in the mammalian liver. 

Through this investigation a number of developments were done 
10 which also are deemed to be encompassed of the present invention. These 
include DNA sequences comprising a gene encoding for the expression of a- 
macroglobulins, especially human Ofe-macroglobulin, or fragments or deriva- 
tives and variants thereof as exemplified. in SEQ ID N0:1 and SEQ ID N0:3. 

Another aspect of the invention relates to functionally operative 
15 expression vectors comprising a gene encoding for the expression of at least 
one o-macroglobulin, especially human Oj-macroglobulin or fragments or 
derivatives and variants thereof, or alleles of such a gene. 

Such vectors preferably further comprise regulatory elements 
necessary for the stable maintenance of said vector in mammalian cells. 
20 Also, such vectors may further include sequences providing for 

the processing and secretion of the expressed product. 

In relation to the use of recombinant or-macroglobulins, and 
especially rc^M, in growth media it may be co-expressed with another desired 
gene product, and consequently the vectors of the invention may further 
25 comprise one or more other genes encoding for a desired gene product. 

The invention further relates to transformed hosts comprising a 
functionally operative expression vector according to the invention compri- 
sing a gene encoding for the expression of human Cfe-macroglobulin or fragments 
30 or derivatives and variants thereof, or alleles of such a gene. 

The host may be selected from the group comprising a bacterial 
strain, a fungal strain, a mammalian cell line, or a mammal, especially a 
fungus, such as belonging to the genus Aspergillus , or a yeast strain, pre- 
ferably belonging to the genus Saccharomvces . 
35 Another preferred type of host is a mammalian cell line, 

preferably a Syrian Baby Hamster Kidney (BHK) cell line, and especially the 
one which is available from ATCC under No. CRL 1632. 
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The invention further relates to the recombinant human Og- 
macroglobulin or a variant thereof in an active form having the amino acid 
sequence of SEQ ID N0:2, or SEQ ID N0:4. 

5 APPLICATIONS OF ot-HACR06LOBULINS. ESPECIALLY rorJi. 

The present invention discloses applications of o-macroglobulins, 
and especially ro^M. These should be regarded not as limitations but as a 
few examples among many for the use of recombinant derived a-macroglobulins. 

10 or-MACROGLOBULINS AS CONSTITUENTS OF DEFINED GROWTH MEDIA. 

Degradation of specific heterologous products produced in either 
transformed or non-transformed mammalian cells is a potential problem in the 
production of recombinant products. This is due to the fact that many host 
cells secretes one or more different proteinases. 

15 When a production cell line is grown in the presence of e.g. 10 

% fetal calf serum, such proteolytic degradation of secreted recombinant or 
native protein products is a minor problem due to a buffering effect of the 
added serum proteins. 

However, the use of fetal calf serum in the large scale growth 

20 (fermentation) of mammalian production cell lines is not a desirable 
situation for a number of reasons. First of all fetal calf serum is a very 
costly constituent of complex growth media; second, the demand for fetal 
calf serum from a growing bi ©pharmaceutical industry might not be easily 
fulfilled in the future, and third, the use of fetal calf serum constitutes 

25a potential quality control problem in the production of pharmaceuticals 
intended for use in humans. 

To circumvent these problems, efforts can be expected in the 
field of development of defined growth media for use with mammalian cells. 

Addition of various proteinase inhibitors to such new defined 

30 growth media will be required to ensure the integrity of the secreted 
products. Alternatively, the producer cell line might, through genetic 
engineering, be endowed with the capacity to produce and secrete proteinase 
inhibitors along with the desired product(s). 

a-Macroglobulins, and especially Human o^M, are proteinase 

35 inhibitors of broad specificity, and they are therefore according to the 
invention used as constituents of defined growth media for mammalian cells, 
either as a medium additive or as a product co-produced with the desired 
product. 
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The target sites for a number of different proteinases, e.g. 
bovine trypsin, Streptomvces qriseus trypsin, papain, porcine elastase, 
bovine chymosin, bovine chymbtrypsin, Staphylococcus aureus strain V8 
proteinase, human plasmin, bovine thrombin, thermolysin, subtil isin Novo and 
5 Streptomvces qriseus proteinase B have been mapped in the bait region of 
human a^ (Mortensen, S.B., et al., (1981) FEBS Lett. 135: 295-300) and other 
a-macroglobulins (Sottrup- Jensen, L, Sand, 0., Kristensen, L. and Fey, G.H. 
J.Biol.Chem. 264 .15781-15789, 1989). It is evident that o^M and the other a- 
macroglobulins as proteinase inhibitors have broad specificities. 

10 In those situations, where the proteinase inhibitory spectrum 

of a a-macroglobulin, such as o^M, is not sufficient for the prevention of 
product degradation, it is possible through site specific mutation, protein 
engineering, etc. to change the proteinase inhibitor specificity of the a- 
macroglobulin, such as o^M. Incorporation of desirable specific proteinase 

15 target sites in the bait region of recombinant c^H will change the inhibitor 
specificity of the mutated aJA. Furthermore it is possible through genetic 
engineering to construct novel specific or general proteinase target sites 
in the bait region of a a-macroglobulin in order to enhance its versatility 
as a proteinase inhibitor of specific or broad inhibitory spectrum. 

20 Furthermore it is possible to remove specific target sites in an a- 
macroglobulin in order to avoid degradation of the variant in question by 
certain proteases in the circulation that will already be inhibited through 
the action of naturally present proteinase inhibitors. 

The production of recombinant products in fungi, such as species 

25 and strains of e.g. Aspergillus and Saccharomvces also meets with potential 
problems of product degradation. In some cases it is possible to isolate 
proteinase negative mutants of desirable production strains. This might not 
always be the case, and co-expression of a-macroglobulins, such as o^M or 
OjjM-mutants together with a desirable product may inhibit proteolysis of the 

30 product in question. 

g-MACROGLOBULIN MUTANTS AS SPECIFIC PROTEINASE INHIBITORS. 

The amino acid sequence of the bait region of a-macroglobulins 
defines the specificity of the a-macroglobulin towards different proteina- 
35 ses. A comparison of cleavage patterns for different proteinases and bait 
region sequences in five mammalian a-macroglobulins has recently been 
published (Sottrup- Jensen, L., Sand, 0., Kristensen, L. and Fey, G.H. The 
a-macroglobulin bait region. Sequence diversity and localization of cleavage 
sites for proteinases in five mammalian a-macroglobulins. J. Biol. Chem. 264, 
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15781-15789, 1989). It has previously been clearly demonstrated that the bait 
region in each species of or-macroglobulin is the major determinant of 
proteinase inhibitor specificity. The present invention demonstrates the 
possibility of modulating the inhibitor specificity of human a^M by 
5 alterations of proteinase target sites in the bait region. 

In the present invention it is demonstrated that the bait region 
of human o^M (residues 690 to 730 in SEQ ID N0:2) can be mutated at will to 
obtain a new proteinase inhibitor profile of this macroglobulin. The example 
presented in the present invention describes the construction of a hybrid 

10 macroglobulin. In this hybrid the bait region from human pregnancy zone 
protein (PZP) was introduced into human o^M, from which the native bait region 
had been removed. The hybrid molecule, which was constructed by the use of 
recombinant DNA technology, revealed a proteinase inhibitor profile similar 
to the inhibitor profile of PZP. 

15 The invention thus demonstrates the possibility to design and 

produce proteinase inhibitors with altered and new inhibitor specificities 
at will. 

This finding is important for the design of new proteinase 
inhibitors. Due to the low antigenicity the bait region in macroglobulins 

20 (Van Leuven, F., Marynen, P., Cassiman, J. -J. and Van den Berghe, H. Mapping 
of structure-function relationships in proteins with a panel of monoclonal 
antibodies. A study on human alpha-2-macroglobulin. J. Immunol. Methods 111 , 
39-49, 1988, and Delain, E., Barray, M., Tapon-Bretaudiere, J., Pochon, F., 
Marynen, P., Cassiman, J. -J., Van den Berghe, H. and Van Leuven, F. The 

25 Molecular Organization of Human alpha2-Macroglobulin. An Immunoelectron 
microscopic study with monoclonal antibodies. J. Biol. Chem. 263 . 2981-2989, 
1988) it is now possible, by the use of the technology described in the 
present invention, to design non-immunogenic new proteinase inhibitors that 
can be used e.g. in the treatment of any disease, where aggressive proteina- 

30ses constitute a threat to the health of man. 

In the present specification the production of ajft variants is 
described by the construction of a hybrid macroglobulin. It is clear to the 
skilled person in the art that changes also could be obtained through other 
genetic engineering methods, such as described in International Publication 

35 No. WO 89/06279 (NOVO INDUSTRI A/S). Also it is clear that other a- 
macroglobulins could be employed instead of the human OgM, such as those 
mentioned in Sottrup- Jensen, L. et al . (1989), supra . 
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r»M AS A PROTETN CARRIER IN FNZYME REPLACEMENT THERAPY. 

A different application of at^ is .its use as a carrier of macro- 
molecules such as proteins and nucleic acids. When a$ reacts with and forms 
a complex with a proteinase in solution, may bind other proteins (also 

5non-proteinase proteins) present in that solution (Salvesen, G.S. et al., 
(1981) Biochem. J. 195: 453-461). In the case of Fabry's disease, which is 
an X-chromosome linked disorder of glycosphingolipid metabolism, it has 
recently been demonstrated that o^M can function as a carrier in an in vitro 
model of enzyme replacement therapy (ERT) (Osada, T., et al., (1987) Biochem. 

lOBiophys. Res. Commu. 142: 100-106). aj\ was conjugated to coffee bean ot- 
galactosidase through the action of trypsin, and the formed complex was 
Internalized through o^M-receptor specific (Van Leuven, F; , et al., (1981) J. 
Biol. Chem. 256j 9016-9022) endocytosis and delivered to the lysosomes, which 
is the target organelle for ojl-receptor mediated internalization of o^M- 

15 proteinase complexes (Will Ingham, M.C. and Pastan, I., (1980) Cell 21: 67- 
77). 

Such a scheme in ERT provides a method of internalization to the 
lysosome of the enzyme in question and at the same time it might alleviate 
potential antigenicity problems arising from the use of heterologous enzymes 

20 in therapy. One limitation in this type of ERT (Osada, T., et al., (1987) 
Biochem. Biophys. Res. Commu. 142,: 100-106) would be the types of potential 
target cells that could be treated by this protocol. Obviously, they would 
have to express the OjM-receptor. In a future development of the system, the 
possibility might exist to redesign the cell specificity of o^M internal iza- 

25tion by exchanging the receptor binding domain of o^M with other receptor 
ligands. Hereby o^M-mutants could be designed to enter any cell type known 
to express a specific internal izable receptor. 

This type of development would of course require a system for 
the production of recombinant derived OjM. The use of native human c^M as a 

30 carrier in ERT (as described above) is undesirable due to the now well known 
risks of the employment of blood derived products in the treatment of human 
disease. 

The production of recombinant ojl in accordance with the present 
invention alleviates this problem by providing for large scale production 
35 of rotsM. 

raJA AS A DNA CARRIER IN G FNE THERAPY. 

Advances in gene transfer Into mammalian cells have opened for 
the possibility of the treatment of a number of genetic disorders through 
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gene therapy. A major problem in gene therapy will be the specific targeting 
of genes Into the appropriate cells within the body. (Williamson, B., (982) 
Nature £98: 416-418; Anderson, W.F., (1984) Science 226: 401-409; Parkman, 
R., (1986) Science 232: 1373-1378). 

5 It was recently described that a constructed foreign gene 

containing the chloramphenicol acetyl transferase (CAT) on a bacterial plasmid 
could be targeted to the liver of rats by specific receptor directed 
internalization (Ml, G.Y. and Wu, C.H. (1988) J. Biol. Chem. 263: 14621- 
14624) The DNA carrier consisted of a galactose-terminal (asialo)glyco- 

lOprotein and asialoorosomucoid covalently linked to poly-L-lysine. The 
polycation poly-L-lysine can bind DNA in a strong non-covalent and nondamag- 
ing interaction. It was demonstrated that complex bound DNA was internalized 
by cell-surface as ial oncoprotein receptors that are unique to hepatocytes. 
The complex was injected intravenously, and upon analysis only the liver 

15 expressed the CAT activity. • 

In the present invention the use of ratjM as a carrier of DNA in 
gene therapy is suggested. Reaction of rojl with a proteinase such as trypsin 
or with methylamine in the presence of covalently closed circular plasmid DNA 
is likely to result in partial or total entrapment of DNA within the 

20complexing a* molecule. After intravenous injection of such complexes with 
exposed receptor binding domains, the complex will be rapidly cleared from 
the blood and internalized in specific target cells, such as hepatocytes and 
Kupffer cells. Through protein engineering on the receptor binding domain of 
r0a M it will be possible to design a DNA carrier specific for other cell 

25 types. The advantage in this system as compared to the above described system 
using the asial oncoprotein receptor is, that it will not be necessary to 
identify different DNA carrier systems for each new cell type. 



30 EXAMPLES 

Materials and methods: 
Microorganisms and cell lines 



E co1i |U2 (MC1061) is available from e.g. Stratagene Inc., 
3511099 North Torrey Pines Rd., La Oolla, California 92037. 

HepG2 (Human hepatoblastoma cell line) is freely available from 
American Type Culture Collection, under No. HB 8065. 

BHK (Syrian Hamster Kidney cell line, thymidine kinase mutant 
line tk«sl3, (Waechter and Baserga (1982) Proc. Natl. Acad. Sci. USA 79: 
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1106-1110); is freely available from American Type Culture Collection, under 
No. CRL 163Z. 

Plasmids and vectors 

5 Plasmids pCDVI-PL and P SP62-K2 are available from Dr. Tasuku 

Honjo, Faculty of Medicine, Kyoto University, Kyoto 606, Japan. pSP62-K2 was 
deHvld from the plasmid P SP62-PL (available from New England Nucle r/ 
Pont (U.K.) Ltd., Wedgwood Way, Stevenage, Hertfordshire, SG14QN) as 

described Noma et al., (1986, Nature, U* 640-646) . pCDVI-PL was denv 
from pcDVl (Okayama, H. and Berg, P. (1983) Molec. cell. Biol. 3. 280-289) 
as described (Noma etal., (1986) Nature, 319:640-646). _ 

M13mpl8 is available from Pharmacia LKB Biotechnology (catalog 
# 27-1552-01) (Norrander, 0., Kempe, T. and Messing, J. Gene 26: 101-106, 

151983) * M13mpl9 is available from e.g. International Biotechnologies, 
Inc., P.O. Box 9558, 275 Winchester Avenue, New Haven, Connecticut 06535, 
USA 

pDHFR-I is available from Dr. K.L.Berkner, ZymoGenetics Inc., 
20 4225 Roosevelt Way NE, Seattle, Washington 98105. (The construction of this 
plasmid is given in detail in: Berkner, K.L. and Sharp, P.A. (1984) Nucleic 
Acids Res. 11: 1925-1941). The molecular cloning of the DHFR cDNA present 
in this plasmid, and its sub-cloning in mammalian expression^ vector, .under 
the control of adenovirus derived promoters has previously been described 
25 in detail (Chang, A.C.Y., et al., Nature 275: 617-624 and Kaufman R 1. and 
Sharp, P.A. (1982) Mol. Cell. Biol. 2: 1304-1319) . The backbone plasmid n 
pDHFR-I is PBR322 (Sutcliffe, O.G. (1979) Cold Spring Harbor Symp Quant. 
Biol. 43: 77-90; Sutcliffe, J.G. (1978) Nucleic. Acids Res. *™-*»\- 
~~ pUC13 is described in: Vieira, J. and Messing, J.: 1982, Gene 19. 
30 259-268 and available from Pharmacia LKB Biotechnology (catalog # 27-4954- 

Q1 * P UC19 is described in: Y an isch- Perron, C. and Messing, J., 1985, 

Gene 33:103-119 and available from Pharmacia LKB Biotechnology (catalog # 
27-4951-01). 

35 
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Growth media 
LB-broth: 

Mix 227 g Bacto Tryptone, Difco 0123-01 

113.5 g Yeast extract, Difco 0127-01, and 
5 227 g NaCl in a sealable plastic container. 

Add 12.5 g mix to 500 ml water in a 1000 ml bottle, shake well and sterilize 
in an autoclave. 

Dulbeccos Modified Eagle Medium is available from e.g. Gibco Ltd. 
10 P.O. Box 35, Trident House, Renfrew Road, Paisley PA34EF, Renfrewshire, 
Scotland. Cat.# 042-250 1M (10 * concentrate). 

Antibodies 

15 Anti-OsW A033 and peroxidase conjugated anti-ogi PE326 were from 

DAKOPATTS A/S, Copenhagen, Denmark. 

EXAMPLE 1. 

CI ONING AND SEQUENCE DETERMINATION O F HUMAN aM 

20 

Preparation of messenger RNA from the hu man cell line HeoG2. 

The human hepatoblastoma cell line HepG2 (American Type Culture 
Collection No. HB 8065, freely available) was used as a source for mRNA 
preparation. HepG2 cells were grown to a total cell number of 15 * 10 7 in 
25Dulbecco's Modified Eagle medium containing 10% fetal calf serum and 
antibiotics. 

Total RNA was isolated by the guanidinium thiocyanate method 
(Chirgwin et al., (1979) Biochemistry 18: 5293-5299) and purified by CsCl 
gradient centrifugation. A total of 3000 jig RNA was obtained. mRNA was 

30 isolated by use of an oli go (dT) -cellulose column (Aviv & Leder (1972) Proc. 
Natl. Acad. Sci. USA 62: 1408-1412). 60 jxg of mRNA was obtained after one 
cycle of affinity chromatography. After ethanol precipitation, this 
preparation of mRNA was resuspended in 10 mM Tris-HCl pH 7.5, 0.1 mM EDTA- 
Na 2 at a final concentration of 1 /ig//il and stored at -80°C for subsequent 

35 use in the construction of a cDNA library. 

rnnstrnrt.inn of a cDNA library from HepG2 mRNA. 

A cDNA library was constructed in the pCDVI-PL/pSP62-K2 vectors 
(Noma et al., (1986) Nature, 319: 640-646, Available from Dr. Tasuku Honjo, 
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Faculty of Medicine, Kyoto University, Kyoto 606 Japan, by use of the 
me thods described by Okayama * Berg (Mol. Cell. Biol. 2: 161-170 (1982), 
Mol. Cell. Biol. 3: 280-289 (1983)). 

E coll K12 (MC1061) (Casadaban & Cohen (1980) J. Mol. Biol. 
5138- 179-207) was used for transformation. MC1061 were grown in L-broth at 
a7c to 0^=0.5. Twenty ml were centrifuged, and the pellet was resuspen ed 
in 7 ml of ice-cold sterile 0.1 M CaCl a , incubated on ice for 30 minutes, 
centrifuged briefly, and finally kept in the cold room overnight 

Ninety-five /il suspension of transformation-competent E^coll 
10MC1 061 were added per 10 Ml of cDNA preparation. The mixture « 

on ice for 30 minutes, heat-shocked at 43,5'C for 45 seconds, and finally, 
after addition of L-broth, incubated at 37'C for 30 minutes. 

After resuspension, the cells were plated onto L-broth plates 
containing ampicillin (50 „g/ml) and grown for 8 hrs at 37'C. A total of 2.9 
15*10* individual colonies could be obtained from this library. 

nf the Hepc lihrarv fo" mNA r.lones encoding human pJIL 
5 * io 4 individual colonies were screened by standard colony 
hybridization technique using nitrocellulose filters (Maniatis et al , (1982) 
ZOMolecular Cloning - A Laboratory Manual, Cold Spring Harbor, New York). 
A 20-mer oligonucleotide mixture 
5' CC(T/C)TTCAT(6/A)TC(T/C)TC(T/C)TC-(T/C)TT 3' 
where the notation (X/Y) -eons that oUhor of tho nucleic adds X or Y jay 
be used, co.plenent.ry to the hu.au -A in the reg,on 
25 acid residues Lys-Gln-Glu-Asp-Ket-Lys-Gly (residues nurter 493 - 499 in 
Sottrrp-densen et al., 0. Biol. Ch«. m 8318-8327 (1984) synthesized 
(on a m synthesizer fro. Applied B1os,st»s, USA), labelled w,th «P (using 
T. polynucleotide kinase and 7 -"2-ATP) to a specific activity of 3 10 
c*».1 oli8.nucle.tlde. The labelled oligonucleotides were pun , by gel 
,0 chronograph, and subsequently used in the "^*«L » «*J 

The hybridization solution contained 6 * SSC, 5 * Denhardt s 
solution, 0.05* SDS (Maniatis et al., (1982) Molecular Cloning 
Manual , Cold Spring Harbor, New York) and lC cpn^nl of labelled ol.go- 

nucleotide mix. _ 
35 Hybridization was performed for 3 hrs at 45'C. Then the filters 

were washed in 5 * SSC, 0.05% SDS at 45'C for 3 * 10 minutes. After-autora- 
diography the filters were washed under the same conditions, but this time 
at 52-C. A colony that still showed hybridization at this temperature was 
isolated and the cDNA insert of the corresponding plasmid (designated po^) 
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from this isolate was sequenced (Tabor & Richardson (1987) Proc. Natl. Acad. 
Sci. USA 84: 4767-4771). The sequence of the cDNA and the derived encoded 
amino acid sequence are shown in the appended sequence listings, SEQ ID 
N0:1:, and SEQ ID N0:2:. 

5 

Characterization of PttJlL 

p0a M had a cDNA insert of approximately 4.6 kb. Its sequence is 

given in Table I above. 
10 The sequence in Table I demonstrates that the entire coding 

region of aj\ including the signal peptide 1s found in the insert. 

In addition to the coding region, the insert contains sequences 
derived from the 5'- and 3' untranslated regions of the o^M mRNA molecule. 

The amino acid sequence of the human o^M as deduced from the cDNA 
15 in P02 M is in total agreement with the published sequence (Sottrup-Oensen et 
al., (1984) J. Biol. Chem. 259j 8318-83Z7). Codon number 1000 (numbered from 
the initiating methionine codon in the signal peptide) was found to be ATC 
encoding an isoleucine and not GTC (encoding a valine) as found in an cDNA 
synthesized from human liver mRNA (Kan et al., (1985) Proc. Natl. Acad. Sci. 
20 USA. 82: 2282-2286). In the 0a M cDNA sequence from the HepG2 library we have 
further identified ten silent changes as compared to the sequence from the 
liver library, see the following Table I: 
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TABLE I 



Liver HepG2 





413 (Asn) 


Aftf 
nnv 


AAT 




ant / nUn \ 

495 (Pnej 


TTT 


TTC 


10 


750 (Gly) 


GGG 


GGT 




796 (Leu) 


CTT 


CTC 


15 


835 (Leu) 


err 


CTA 




1266 (Ala) 


GCC 


GCA 




1296 (Asn) 


AAT 


AAC 


20 


1326 (Thr) 


ACC 


ACA 




1442 (Leu) 


CTC 


CTG 


25 


1460 (lie) 


ATC 


ATT 



The position of the oligonucleotide mixture used as a hybridiza- 
tion probe in the colony screenings was fro. position 1574 ^ ^» ^. 
30 and the position of the reactive thiol ester is from posits 2939 to 2953 



in SEQ ID N0:1 
EXAMPLE 2. 

35 



Construction of a mammalian expression vector for sJL ' 

pojl was digested (fig. la) with Xbal and EcoRI, and a 1.2 kb 
fragment containing the 5' part of the c# cDNA together with the ^multiple 
cloning site of P SP62-K2 was isolated on an agarose ge and cloned n J 
Xbal/EcoRI digested M13 m pl9 vector to generate M13mpl9A. To facilitate 
^th^subclonings of the ^ cDNA, a unique EcoRV site was introduced in 
40the 1.2 kb fragment 10 nucleotides 5' to the initiating ATG ^h = 
codon through site directed mutagenesis (Kunkel et al (1987) Method 
Enzymol. 1M: 367-382). In the same mutagenesis experiment, in which the 

mutagenic oligonucleotide N0R593: 

5 ' (TTCTTCCCCATGGTGGATATCGAAGGAGCTG) 3 ' 

45was used, the 5 nucleotides 5' to the methionine codon « to 
CCACCAJG; this mutation creates a new ficfil site spanning the ATG codon. A 
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correct mutant M13mpl9B was identified through restriction enzyme digestion 
and DNA sequencing. 

The mutated 5' end of OjM cDNA was isolated from M13mpl9A repli- 
cative form through digestion with Hjndlll and £coRI and agarose gel electro- 

5 phoresis. The isolated DNA fragment was then joined to mndlll/EcoRI digested 
pOjM through ligation to generate pll36. In this plasmid the o^M cDNA is 
reassembled in its total length, but now with a unique fcoRV site at the 5' 
end. pll36 was digested with ££O.RV/DraI, and the o^M fragment was isolated on 
an agarose gel and cloned in a mammalian expression vector under control of 

10 the adenovirus 2 major late promoter (Ad 2 MLP). 

The adenovirus -promoter based vector was constructed by K.L.Berk- 
ner (ZymoGenetics Inc., Seattle, WA.), and a detailed description of the 
functional elements in the mammalian expression vector is given in: Powell, 
J.S. et al., (1986) Proc. Natl. Acad. Sci. USA 83: 6465-6469 and in: Boel 

15 et al., (1987) FEBS Lett. 219: 181-188). 

The expression vector used for expression of human OjM was 
generated from the mammalian expression vector pPP (Boel, E. et al., (1987) 
FEBS Lett. 219: 181-188), in which human pancreatic polypeptide cDNA was 
cloned under control of Ad 2 MLP. 

20 pPP was digested (fig. lb) with BamHI and the resulting stag- 

gered ends were repaired with DNA polymerase (Klenow fragment and the four 
deoxynucleotide triphosphates). The 4.5 kb EcoRV/Dral o^M cDNA fragment was 
joined to this vector through ligation, and correct recombinants were 
characterized through restriction enzyme analysis on isolated miniprep. 

25plasmids. 

The OjM-mRNA transcribed from the resulting 8.76 kb plasmid 
(designated pll67 (fig. 2)) has the adenovirus 2 late tripartite leader (Ll- 
3) at its 5' end together with an mRNA splice signal (SS). At the 3' end of 
the construct the transcript is terminated with the SV40 late termination - 
30 and polyadenylation signal. 5' to the Ad 2 MLP the construct includes the 
SV40 enhancer (ENH) and the 0 to 1 (0 - 1) map units from adenovirus 5. 

Expression of aM in mammali an cells. 

For expression of human ojl in cultured BHK cells (Syrian Hamster 
35 Kidney, thymidine kinase mutant line tk*sl3, (Waechter and Baserga (1982) 
Proc. Natl. Acad. Sci. USA 22: 1106-1110); American Type Culture Collection 
CRL 1632) the expression vector pll67 was co-transfected with pDHFR-I (Berk- 
ner, K.L. and Sharp, P.A. (1984) Nucleic Acids Res. 12: 1925-1941. Available 
from K.L.Berkner, ZymoGenetics Inc. Seattle) into subconfluent cells by the 
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days, individual colonies were clone 

H r »i mqas. Scand. J. Clin. Lab. invest. 22. 

lO(ELISA) (Munck Petersen C, et al., (1985) beano. 

735-740) against human oji secreted to the growth medium. 

The materials used in the EL ISA were: 

15 Catching antibody A033 anti-o^M, 

Peroxidase-conjugated anti-o^ antibody PE326, 
1,2-Phenylenediamine, di hydrochloride (OPD) 
all from DAKOPATTS A/S, Copenhagen, Denmark. 
Urea peroxide, 125 mg, was from Organon Teknika. 

20 96 well ELISA plates were from NUNC, Copenhagen. 

Coating buffer: fallows- 
100 mM carbonate buffer P H 9.6 was made up as follows. 

Add 3.18 g Na 2 C0 3 and 5.96 g NaHCO, to 1000 ml water. 

25 

Standard and sample buffer: 

To 100 ml of 150 mM phosphate buffer pH 7.2 was added. 

50 /il Tween 20 

2 g Bovine Serum Albumin (Sigma A 7030). 

30 

Washing buffer: 

10 mM sodium phosphate pH 7.4 
145 mM sodium chloride 
0.1 % Tween 20. 



35 



Citric acid-phosphate buffer, pH 4.9: 

The following reagents were added to 1000 »1 of water 



7.3 g citric acid 
23.88 g Na^PO,, 12 H£ 
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The buffer was used for a maximum of 14 days, stored at 4*C. 

Urea peroxide solution: 
5 125 mg urea peroxide was dissolved in 8.93 ml water. 

The solution was kept in the dark at 4T. 

Coating of the plates for assay: 

The 96 well plate was coated with 175 Ml of the OAKO A033 
10 antibody diluted 1:1000 in the coating buffer. The plate was incubated over 
ight at 4-C. Before use the plate was washed 4 times in washing buffer. 



m< 



Application of standards and samples: _ 

100 Ml standard or sample was added to each well. As a standard 

ISpurified human OsM, 2 mg/ml (prepared as described in: Sottrup-Jensen et al 
(1983) Ann. N.Y. Acad. Sci. 421: 41-60) was used. The standard curve included 
the following serial dilutions: 1:4000, 1:8000, 1:16000 etc. down to 
1:1024000, corresponding to final concentrations from 500 /ig/1 down to 1.95 
ug/1 All dilutions were done in the Standard and sample buffer. The plate 

20was incubated over night at 4'C and then washed 4 times with wash buffer 
before the next step. 

Addition of conjugated antibody: 

100 Ml of PE326, which had been diluted 1:6000 in the Standard 
25and sample buffer, was added to each well. The plate was incubated for 2 h 
at 20'C, and then washed 4 times with wash buffer. 

Enzyme activation: 

8 mg of OPD was dissolved in 12 ml of Citric acid- phosphate 
30 buffer. To this solution 500 M l Urea peroxide solution was added and the 
mixture was used immediately. 100 M l of the final solution was added to each 
well, and the plate was incubated in the dark for 6 min. Then 100 Ml of 2 H 
H^O, was added to each well and the A« was read in an automated ELISA plate 
reader. 

The above described ELISA did not give any background on medium 
supplemented with 10% FCS, nor did it give any background in BHK cell 
conditioned medium. Of 24 isolated MTX resistant clones, 16 produced 
detectable amounts of recombinant OjM. 
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Selected cell lines that secreted 12.3 mg/1 (K16-6) and 19 1 

hour period were expanded for large scaie pro 
(rojM). 

5 -~^^ „re each ox^--- 

confluenc, the me.hu. on the cells «. ^ ^ 

lOcalf serum (FCS) do- to 2%. After « hour, of 9 senM 

b? The^led conditioned medium fro» K16-6 and fro. K17-6 contained 

2 „,.,„od,.c.tlc acid Sephar.se 4B .Porat , ^.jp^^ « 
599) equilibrated ntk 2 * 2 until V 0.036. A second wash -1th 

phosphate ^*~™%*T?~t«>~* «- < °- 0M - 
20 off sodium phosphate, 500 n*1 Had pH was v 

fractions were collected^ ^ _ „„ 

on an^lcon devise equipped with ' 
then loaded onto a Soporose 12 gelflltration column 25 H Tm HC1, 150 
30Nad PH 8.0). The ^ contalnln, fractions were pooled and stored at -20 
until analysis. 

EXAMPLE 3. 

rK a y. a rtpri7ati - of yprombinant. human re^ 

35 tho thiol «+ 0 r- thgrjn aj fomentation and 
a rhomiral E SaSfejOBS at thP thin) esier. 

mo fh Y 1 amine ifldj&gd, Haavaae. 
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A number of different analyses were performed to evaluate the 
structural and biological characteristics of the human ro^ as compared to 
a preparation of human plasma derived o^M, designated preparation LSJ39. 

An important structural feature of aj\ is the presence of the 
5 thiol ester. When heated to 95'C for 15 min, the thiol ester will induce « 
peptide bond cleavage in the backbone of at the position of the thiol 
esterified Glx-residue. This results in the fragmentation of the 180 kD afl 
monomer into two polypeptides of 120 kD and 60 kD. Fig. 3 shows an analysis 
of both the purified rafl (from two transformed BHK cell lines) and the 
lOpurified human plasma derived preparation LSJ39 on a 10-20% SDS polyacryl- 
amide gel. The different preparations, either native human or BHK cell 
derived recombinant c# were all heat treated to induce thermal fragmenta- 
tion before loading onto the gel. Molecular weight markers (from top to 
bottom: 180, 120, 92, 60, 43, 26, 14 and 6 kD) were applied to lanes 1 and 
15 8. Samples in lanes 2, 3 and 4 were not reduced before electrophoresis, while 
samples in lanes 5, 6 and 7 were reduced. Preparation LSJ39 was applied to 
lanes 2 and 5. rojl K16-6 was applied to lanes 3 and 6, and ror^ K17-6 was 

applied to lanes 4 and 7. 

It was clear from the patterns of protein fragments on the gel, 

20 that both human ^ and the two ro^ preparations showed a considerable degree 
of thermal fragmentation. As expected, only the reduced samples displayed 
this fragmentation. In the nonreduced samples, the molecules migrated as the 
360 kD dimer. 

In the human plasma derived preparation LSJ39 (lane 5) a fragment 
25migrating slightly faster than the 60 kD fragment could be observed. Lanes 
6 and 7 indicated the presence in the recombinant material of a similar 
faster migrating fragment. It is possible that this fragment represented a 
slightly underglycosylated variant of the 60 kD fragment. 

Methylamine (MA) and other small nitrogen containing nucleo- 
30philes will cleave the thiol ester and thereby inactivate the ester (Sottrup- 
Jensen, L., et al., (1980) FEBS Lett. 121: 275-280; Salvesen, G.S. et al., 
(1981) Biochem. 0. 1S5_: 453-461). After MA induced inactivation of the thiol 
ester, thermal fragmentation of o,M can no longer be observed. 

Fig. 4 shows a SDS-PAGE run similar to that shown in Fig. 3 (with 
35 respect to loaded samples), in which applied and ro^M had been pretreated 
with MA. From this gel it was concluded, that the thiol ester of r^M was just 
as susceptible to cleavage with MA as the thiol ester of native <gi. Upon 
reduction MA-treated o^M and migrated as a single 180 kD monomer species. 
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Lanes 5 of both Fig. 3 and 4 shoved an additional band of 
approximately 85 .». When « is cleaved in the bait region by proteinases 
present in the blood, it generates two fragments, each with a molecular 
Z t o 85 tt. The human a* preparation LSJ39 (purified from serum 

5con ained these cleavage products, while they could not be detected on t 
gel in the two r^M preparations. This indicated that the material ^secre 
from the transformed BHK cell lines was largely native uncompleted ajt Any 
* molecules, that have reacted with proteinases are inactiva ed a nd 
not form additional complexes with other proteinases. Since the ^BHK ce 11 

lOdoes not produce any proteinases that forms complexes with the , 
this cell is therefore well suited for production of recombinant human a,M. 

R Rpartion w jjb, trypsin. , BV • 

React on with trypsin is a standard way of analyzing the proteinase -complex 
ability^ (Sottrup-Jensen, L. (1987) in: "The Plasma Protein, 
(Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press Orlando I , Har e , 
P C (1973) J. Exp. Med. 138: 508-521; Harpel, P.C, et al., (1979) J. Biol. 
Cnem. 2J*: 8869-8878; Swenson, R.P. and Howard, J.B. (1979) J. Biol. Chem 
254: 4452-4456). In this reaction trypsin will -cleave at its target siteOO 
ZO^Tthe bait region of a,M, and the resulting reduced cleavage products (85 kD) 
will migrate as a double band. Under nonreducing conditions the trypsin-^ 
complexes will migrate as high molecular weight products. 

Fig 5 shows the result of such an analysis (performed as 
described (Sottrup- Jensen, L. (1987) in: "The Plasma Proteins" (Putnam, FJi.. 
25ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Harpel PC (1973 
J. Exp. Med. 138: 508-521; Harpel, P.C., et al., (1979) J. Biol, hem m 
8869-8878; Swenson, R.P. and Howard, J.B. (1979) J. Biol. Chem 254_: 4452- 
4456)) on the native human oji preparation LSJ39 (lanes 2 and 5) and on rc# 
from cell lines K16-6 (lanes 3 and 6) and K17-6 (lanes 4 and 7). The ^samples 
30in lanes 2, 3 and 4 were not reduced before electrophoresis, while the 
Mam ies in lanes 5, 6 and 7 were. Lane 5 shows that almost all of the human 
native was cleaved with trypsin, while the two preparations of rojl were 
cleaved Tith an efficiency of approximately 80% or more. Without reduction 
of the complexes no low molecular weight products from the reaction between 
35 trypsin and the native ^ or the BHK cell derived rojl were seen on the gel 
The 85 kD fragments derived from the recombinant material migrated somewhat 
faster than the human standard; as mentioned above the recombinant materi- 
al might be slightly underglycosylated. 
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When OjM is reacted with methyl amine, the thiol ester will be 
inactivated, and afl changes conformation from the "slow" form to the "fast- 
form (Sottrup- Jensen, L. (1987) in: The Plasma Proteins (Putnam, F.W., ed.) 
2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Van Leuven, F., Cassiman, 

5 J. -J. and Van Den Berghe, H. (1981) J. Biol. Chem. 256: 9016-9022). In this 
conformation it can no longer react rapidly with or form complexes with 
proteinases such as e.g. trypsin. 

Fig. 6 shows the results of a set of experiments that were run 
in parallel to the experiments described above and shown in Fig. 5. However, 

10 before reaction with trypsin the native human otjM and the ro^M used in this 
experiment had been treated with methyl amine (Sottrup-Jensen, L., et al., 
(1980) FEBS Lett. 121: 275-280). Under these conditions both the native ajl 
and the rc^M show a marked decrease in reactivity towards trypsin (80% or 
more of the a# and raS monomers were migrating as a 180 kD polypeptide). 

15 This indicates that trypsin does not rapidly cleave at the bait region in 
methyl amine treated human a# or in BHK cell derived rOjM. 

In these types of experiments BHK cell derived ra# has shown 
characteristics similar to those of native human a^M. 

20 C. Trypsin and methvlamine induct conforma tional chanoe in 

As mentioned above the molecule will undergo a conformational 
change both through complex formation with proteinases and through methyl - 
amine induced cleavage of the thiol ester. The change in structure results 
in an altered mobility on rate gels (Sottrup-Jensen, L. (1987) in: The Plasma 

25Proteins (Putnam, F.W., ed.) 2nd Ed., 5_: 191-291, Academic Press, Orlando, 
FL; Van Leuven, F., Cassiman, J.-J. and Van Den Berghe, H. (1981) J. Biol. 
Chem. 2§6_: 9016-9022); unreacted aJA will migrate as a "slow" form, while 
reacted ojl will migrate as a "fast" form. 

Fig. 7 and Fig. 8 show these conformational changes, as they 

30 appear after reaction with trypsin and methylamine, respectively (analyzed 
on 5-10% rate gels). 

Lanes 1 on both gels contain purified human pregnancy zone 
protein (PZP) (Sand, 0, et al., (1985) J. Biol. Chem. 260: 15723-15735), 
which is known to appear in both a dimeric (D) and a tetrameric (T) 

35 configuration. 

Lanes 2 on both gels contain unreacted human aJA preparation 
LSJ39. Lanes 3 on both gels show the fast migrating form, resulting from 
reaction with trypsin and methylamine, respectively. Lanes 4 on both gels 
show the unreacted rOjM preparation K16-6, and lanes 5 show the corresponding 
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fast forms. Lanes 6 on both gels show the unreacted ro^M preparation K17- . 
6, and lanes 7 show the corresponding fast forms. 

It can be concluded that both complex formation between raj\ and 
trypsin and reaction of r^ with methylene result in the appearance of 

5 fast migrating structures. These structures appear (as analyzed on rate gels) 
to be very similar to the structures obtained when human o,M was allowed to 
react with trypsin and methylamine. It is also evident from these figures 
that the r*# proteins showed a migration, which, when compared to the 
migration of dimeric and tetrameric PZP on the gels, is in agreement with the 

10 finding that these molecules are produced and secreted from the BHK cells in 
the active tetrameric conformation. 

n Chromatogra phy of aM on a Superose 6 column. 

A Superose 6 column can partially resolve c& molecules in the 

ISdimeric configuration from molecules in the tetrameric configuration 
(Sottrup-Jensen, L. unpublished). Human standard and ro,M was analyzed 
on a 24 ml Superose 6 column (buffer: 25 mM Tris-HCl, 125 mM NaCl P H 8.0; 
flow rate: 1 ml /min; fraction size: 1 ml). Fig. 9 shows the diagrams obtained 
from the chromatography of purified human standard a* and ro^ from the K17- 

206 and the K16-6 BHK cell lines. Tetrameric (Sottrup-Jensen, unpublished 
observation) will elute in fraction 12 on this type of column. It is evident 
from the chromatograms that both of the rctf preparations eluted in fraction 
12 as did the human standard o^. On this type of column, dimeric o^M 
molecules will elute in fraction 14 and 15 (Sottrup-Jensen, unpublished 

25 observation). This type of analysis supported the results obtained from the 
rate gels (Figs. 7 and 8), that rctfl was secreted from BHK cells in a 
tetrameric configuration. 

F Trypsin retortion analysis. 

30 When trypsin is trapped inside the a# molecule, it retains its 

catalytic capacity towards low molecular weight substrates such as S-2222 
(N-benzoyl-L-Ile-L-Glu-Gly-L-Arg-p-nitroanilide). If trypsin is efficiently 
complexed with it will be protected against high molecular weight 
inhibitors such as Soybean Trypsin Inhibitor (STI) (Sottrup-Jensen, L. (1987) 

35 in- The Plasma Proteins (Putnam, F.W., ed.) 2nd Ed., 5_: 191-291, Academic 
Press, Orlando, FL; Ganrot, P.O. (1966) Clin. Chim. Acta, li*- 493-501; 
Sottrup-Jensen, L. et al., (1981) FEBS Lett. 128: 127-132). 

K16-6 and K17-6 derived ro^M was compared with human plasma a& 
in such a protection assay. 100 ,1 (in 25 mM Tris-HCl, 125 mM NaCl, P H 
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8.0) was mixed with 30 /il trypsin (0.5 mg/ml in 20 mM sodium acetate pH 5.0). 
After incubating for 2 min. 30 ftl 1 mg/ml STI (in PBS) was added. 10 /il ali- 
quots were removed after 2 and 4 min. and each mixed with 750 /il 0.12 mM S- 
2222 (dissolved 0.1 M sodiumphosphate pH 8.0, 5% dimethyl sul foxide) . 
5 The change in absorbance at 405 nm was recorded for 2 min. The 

results of the assay are given in the following Table II: 



TABLE II 



15 



Prep, of OjM. 


o^M in cuvette. 


Activity. 




Aaa/min 


m 


A^min/^g 


Human LSJ39 


0.140 


5.00 


0.028 


K16-6 


0.111 


4.62 


0.024 


K17-6 


0.119 


4.87 


0.024 



20 

From these results it can be concluded that rotjM had essential- 
ly the same protection capacity for trypsin against STI as compared with the 
protection capacity of human plasma oji. 

If aJA is treated with methylamine before the protection assay, 
25 the protection capacity drops dramatically. In a similar assay as that 
described above, methylamine treated human plasma o^M only retained 17% of 
its protection capacity, while K16-6 and K17-6 rOjM retained 16% and 14% 
respectively. It can be concluded that rajl protected trypsin against STI 
with almost the same efficiency as did human plasma a^M. 

30 

E. Amino terminal amino acid s equencing of raJI. 

Theoretically, the aJA characterized in the present investiga- 
tion could only be either bovine (contaminant from serum), from hamster 
(endogenous product from the BHK cell) or derived from expression of the 

35 transfected plasmid pll67. The EL ISA assay used never recognized any osjH in 
BHK cell conditioned medium, whether with or without added fetal calf serum. 
To make sure that the investigated o^M was human o^M, and to characterize the 
amino terminal processing of the recombinant product, amino terminal amino 
acid sequence determination was carried on out K16-6 and K17-6 ra^M as 

40 described (Sottrup-Jensen, L. et al., (1984) J. Biol. Chem. 253: 8293-8303). 
The Edman degradation was repeated for 12 cycles, and the identity of the 
detected amino acid derivative in each cycle, was in total agreement with the 
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amino terminal sequence of human Ser-Val-Ser-Gly-Lys-Pro-Gln-Tyr-Met- 
Val-Leu-Val-, whereas bovine a$ has the following amino terminal sequence: 
Ala-Val-Asp-Gly-Lys-Pro-Gln-Tyr-Met-Val-Leu-Val- (unpublished, Dr. Torsten 
Kristensen, Department of Molecular Biology, University of Aarhus, Denmark.) 

5 

EXAMPLE 4. 

rnn<tmrt.inn an* a y pra«inn of a halt, region mutant, of human o^. 

In the present example it is demonstrated that the bait region 
of human o* can be substituted by the bait region of human pregnancy zone 

lOprotein (PZP) (Sottrup Jensen, L., Folkersen, J., Kristensen, T. and Tack, 
B.F. Partial primary structure of human pregnancy zone protein: extensive 
sequence homology with human alpha 2-macroglobulin. pror. Natl. Acad. Sci A 
U.S.A. 81, 7353-7357, 1984; Sand, 0., Folkersen, J., Westergaard, J.G. and 
Sottrup Jensen, L. Characterization of human pregnancy zone protein. 

15 Comparison with human alpha 2-macroglobuHn. J^Bjol^Chegu 2§0_, 15723-15735, 
1985). The resulting o^M bait region mutant exhibited a proteinase inhibitor 
profile similar to that of human pregnancy zone protein. 

To facilitate substitution of DNA fragments encoding the bait 
region of human aj\ cDNA, target sites for the restriction enzymes PstI and 

20SacII were introduced at the 5' and at the 3' end of the cDNA region encoding 

the bait region. 

The human o^H expression plasmid pll67 was digested with BamHI and 
Clal, and a 2660 bp fragment, which carried the central part of the human 
aj\ cDNA, was subcloned in the fiamHI and CJiI digested vector pSX191. 
25 This vector, which had previously been constructed, is a 

derivative of P UC19. It was constructed as described: pUC19 was digested 
with EcoRI and HindHI, and a synthetic linker with the following sequence 

Kpnl £stl EcoRI Hind3 Clal Sfihl EmHI 
30MTTGCTACCCTGCAGGMTTCAAGCTTATCGATGGCATGCGGATCC - N0R781 

CCATGGGACGTOT - N0R78Z 



was cloned in the digested pUC19 vector. The linker, which was an annealing 
product from the two synthetic oligonucleotides N0R781 and N0R782, has 
35cohesive ends that will ligate to the £coRI and the HindHI sites of pUC19 
in such a way that these ligation sites are not regenerated in the pSX191 
vector. Thus pSX191 carried sites for JCpjiI, PstI, EcoRI, HindHI, Clal, SphI 
and BamHI . 

The resulting plasmid pSXigio^M was digested with BjjHI and 
40HJndIII, and a purified 2.6 kb BjjgHI/MndlH fragment was cloned in 
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M13mpl8 to generate MISmplSataM for mutagenesis by described methods. A 
synthetic oligonucleotide N0R973, with the following sequence: 

5' (TTCATACTGCTGCAGCTGT6GACAC ) 3 ' 
was used to introduce a £stl site at position 2102 (SEQ ID N0:1) in the cDNA 

5 sequence, and a oligonucleotide (N0R974) with the following sequence: 

5' (AGCCACCC££GCGGAGTTTACCAC)3' 
was used to introduce a SacII site at position 2271 (SEQ ID N0:1) in the 
cDNA sequence. These sites were chosen because they did not introduce 
alterations in the encoded amino acid sequence, and they were within a 

10 convenient distance of the bait region in human a^M cDNA. Both primers were 
used in the same mutagenesis experiment (Kunkel, T.A., Roberts, J.D. and 
Zakour, R.A. Rapid and Efficient Site-Specific Mutagenesis without Phenotypic 
Selection. Methods in Enzvmol. 154 . 367-382, 1987); dsDNA was isolated from 
mutated MISmplSctjM plaques, and the DNA was digested with the restriction 

15 enzymes PstI and SacII. Correctly mutated recombinants, which had an insert 
of 160 bp, were further analyzed by DNA sequencing (Tabor, S. and Richardson, 
C.C. DNA sequence analysis with a modified bacteriophage T7 DNA polymerase. 
Proc. Natl. Acad. Sci . U.S.A. 84, 4767-4771, 1987). A 2.6 kb BMnHI/Hjndlll 
fragment from a correct aS cDNA mutant (M13mpl8o^M#212.1) was subcloned in 

20a BamHI/HindlH digested pUC13 vector, and a correct subclone pl308 was 
isolated and characterized with BamHI/HindlH and Pstl/SjcII double 
digestions and DNA electrophoresis. 

The Pstl/SacII fragment in pl308 can be excised and replaced 
with a different DNA fragment, which encodes bait region variants. The 

25 resulting new variants (bait region mutants or analogs) of o^M cDNA can be 
isolated as BamHI/Clal fragments and subcloned back into BamHI/Clal digested 
expression vector pll67. 

In the present example DNA encoding the amino acids of the bait 
region for human PZP (Sottrup-Jensen et al. 1989, supra ) was obtained from 

30 ligation, annealing and cloning of 8 synthetic oligonucleotides. 

The DNA sequence of the synthetic fragment and the encoded amino 
acids as inserted into the ctjM clone are given in SEQ ID N0:3, and comprises 
positions 2107 to 2305 and the corresponding amino acids. A P_stl site was 
introduced at the 5' end in the synthetic fragment, and SacII and BamHI sites 

35 were introduced at the 3' end. 

This synthetic 0.2 kb DNA fragment was cloned in a Pstl/BjmHI 
digested M13mpl8 vector for DNA sequencing. DNA from a clone containing the 
correct sequence was digested with Pst I and SacII. and the purified 0.2 kb 
fragment was cloned in a Pstl/SacII digested and gel purified pl308 vector. 
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A correct recombinant, P 267PZP, was characterized with restriction enzyme 
digestions, and from this plasmid, bait region mutated (a* - PZP) cDNA was 
isolated as a 2.7 kb BjmHI/Clal fragment and subcloned in a BaiHI/CJiI 
digested afl expression vector pll67. The resulting plasmid, designated P 1365, 
5was grown as a large scale plasmid preparation, purified by CsCl centrifuga- 
tion, and cotransfected with pDHFR-I into BHK cells. 

Through thi s .procedure the nucleotides 2102 to 2275 in SEQ ID 
N0:l was removed and replaced with nucleotides 2102 to 2305 in SEQ ID N0:3. 

The procedures for transfection, selection of bait region mutated 
10<gf (designated roJI-PZP) recombinants (with an <gt specific ELISA), large 
scale production and purification of mutated c& were as described elsewhere 
(EXAMPLE 2) in this application. 

C^arjctgjjzjt i QD Pi the nrnteinase inhibitor specificity of I bait region 

15 mutant o f human aJI. 

The purified recombinant o^M mutant, ro^M-PZP, was characterized 
with respect to its inhibitor specificity profile against various proteina- 
ses by the use of previously described methods (Sand et al.1985). For 
comparison human plasma derived c& and PZP were treated with the same set 
20 of proteinases in parallel reactions. The proteinases used were chymotryp- 
sin, elastase, trypsin and SJapJivlococcus aureus Glu-specific proteinase. 
It has been reported (Sand et al.1985) that chymotrypsin and elastase show 
a rapid reaction with both PZP and <*j\, while the reaction between the two 
proteinase inhibitors and trypsin and Staphylococcus aureus Glu-specific 
25 proteinase is quite dissimilar for PZP and o^M: both proteinases react rapidly 
with o^M, while the reaction with PZP is slow (Sand et al.1985). The reason 
for this difference in reaction rate with the different proteinases is 
believed to be due to the fact that the bait region in PZP contains strong 
specificity determinant for chymotrypsin and elastase, but none for trypsin 
™ a „H staphvinrnccus aureus Glu-specific proteinase. 

The results of the analysis is presented in figures 10 to 13. 
Figure 10 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from chymotrypsin treated human human 
PZP and rajM-PZP. Molecular weight markers (from top to bottom: 180, 120, 92, 
35 60, 43, 26, 14 and 6 kD) were applied to lanes 1 and 8. All samples were 
reduced. Lanes 2, 3 and 4 show the cleavage products obtained from reaction 
of chymotrypsin with human plasma derived PZP, ro^M-PZP and human plasma 
derived ojt, respectively. The ratio of proteinase to inhibitor was 1:1. Lanes 
5, 6 and 7 show cleavage products from similar reactions at a ratio of 2:1 
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between proteinase and the three tested Inhibitors. In all 6 lanes cleavage 
products (85 kD) could be identified. This indicated that ra^-PZP reacted 
with chymotrypsin with similar characteristics as did human plasma derived 
and PZP. 

5 Figure 11 illustrates the gel electrophoresis (10 - 20 % reducing 

SDS-PAGE) of the reaction products from elastase treated human o^M, human 
PZP and rofeM-PZP. Molecular weight markers were the same as applied on the 
gel in Fig. 2. All samples were reduced. Lanes 2, 3 and 4 show the cleavage 
products obtained from reaction of elastase with human plasma derived PZP, 

lOrokM-PZP and human plasma derived c^M, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In all 6 lanes cleavage products (85 kD) could be identified. 
This indicated that ra^M-PZP reacted with elastase with similar character! s- 

15 tics as did human plasma derived aJA and PZP. 

Figure 12 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from trypsin treated human o^, human PZP 
and rofeM-PZP. Molecular weight markers were the same as applied on the gel 
in Fig. 2. All samples were reduced. Lanes 2, 3 and 4 show the cleavage 

20 products obtained from reaction of trypsin with human plasma derived PZP, 
human plasma derived o^M and rofeM-PZP, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In lanes 3 and 6 cleavage products (85 kD) could be identified 

25 from the reaction between trypsin and o^M. In lanes 2, 4, 5 and 7 no cleavage 
products were observed from the reaction of trypsin with PZP and rofeM-PZP. 
This result demonstrated that raaM-PZP reacted poorly with trypsin as did 
human plasma derived PZP, while o^M was cleaved in the reaction with trypsin. 

Figure 13 illustrates the gel electrophoresis (10 - 20 % reducing 

30 SDS-PAGE) of the reaction products from Staphylococcus aureus Glu-specific 
protease treated human o^M, human PZP and roijM-PZP. Molecular weight markers 
were the same as applied on the gel in Fig. 2. All samples were reduced. 
Lanes 2, 3 and 4 show the cleavage products obtained from reaction of 
Staphylococcus aureus Glu-specific protease with human plasma derived PZP, 

SSrofeM-PZP and human plasma derived o^, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In lanes 4 and 7 cleavage products (85 kD) could be identified 
from the reaction between Staphylococcus aureus Glu-specific protease and 
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OjM. In lanes 2, 3, 5 and 6 much less cleavage product could be identified 
from the reaction of this proteinase with PZP and ro^-PZP. This result 
demonstrated that ro^M-PZP reacted poorly with the Staphylococcus aureus 
proteinase as did human plasma derived PZP, while o^M was cleaved in the 
5 reaction with this proteinase. 

It can be concluded that ro^M-PZP showed the same pattern of 
reaction with four proteinases as did human plasma derived PZP. This pattern 
of reaction was different from the corresponding pattern obtained from 
reaction with OjM. Thus rOgM-PZP has been demonstrated to have a proteinase 
10 inhibitor profile similar to native PZP and dissimilar to ct£. Thus it has 
been demonstrated that the proteinase Inhibitor profile of o^M can be 
modulated by substitution of DNA fragments encoding the bait region. 

The substitution as described in this invention did not destroy 
the activity of the proteinase inhibitor, and it is therefore demonstrated 
15 that functional macroglobulin hybrids can be constructed by substitutions 
(mutations) in the bait region. The finding will lead to the design of a^M- 
derivatives with new desired proteinase specificities. No doubt, these 
results could be extended to other macroglobulin based hybrids, 1n which the 
bait region can be modified at will to obtain new inhibitor specificities. 
20 Aggressive activity of proteinases is often a problem in relation 

to various diseases (e.g. the activity of elastase and cathepsin G in severe 
inflammation leads to tissue and organ destruction and failure). Inhibitors 
of such proteinases will be useful in drug design. In situations where the 
target site for the proteinase is known, but no inhibitor can be identified, 
25^ can be engineered (mutated in the bait region) to obtain the desired 
specificity. In a situation where the target specificity of the proteinase 
in question is unknown, saturation mutagenesis or random synthesis of the 
bait region will lead to an indefinite number of target sequences that can 
be introduced and expressed in hybrid macroglobulins. These hybrids can be 
30 screened for proteinase inhibition, and the target sequence(s) can be 
identified. The resulting otjM analog can be produced and purified as described 
elsewhere in this invention. Upon injection into the circulation such OjM 
analogs will inhibit and clear from the blood any proteinase of the given 
specificity. 

35 Introduction of protein analogs or mutants in the human body 

always raises the possibility for antigenicity. The generation of-a panel 
of 45 mouse monoclonal antibodies against human ojl has been described (Van 
Leuven et al.1988; Delain et al.1988). None of these antibodies were directed 
against the bait region. This indicates that the bait region is not highly 
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antigenic and that mutants in this region of the molecule can be generated 
and used for therapeutical uses without risk for antibody development. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(1) APPLICANT: Novo Nordisk A/S 

(ii) TITLE OF INVENTION: Expression of Plasma Glycoproteins 
(1ii) NUMBER OF SEQUENCES: 4 

(,V) ^"'Sls^ VS, ment Department 

(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 

(E) COUNTRY: DENMARK 

(F) ZIP: DK-2880 

(vii) PRIOR APPLICATION DATA: nv &r ,c /M nv 4237/89 

(A) APPLICATION NUMBER: DK 4235/89, DK 4236/89, DK 4<U//t« 

(B) FILING DATE: 29-AUG-1989 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: Hepatic 

(G) CELL TYPE: Hepatoblastoma 

(H) CELL LINE: HepG2 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 29.. 4450 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GTCTCCTCCA GCTCCTTCTT TCTGCAAC ATG GGG AAG AAC AAA CTC CTT CAT 

1 5 

CCA AGT CTG GTT CTT CTC CTC TTG GTC CTC CTG CCC ACA GAC GCC TCA 
So Ser Leu Val Leu Leu Leu Leu Val Leu Leu Pro Thr Asp Ala Ser 
10 15 . 2° 
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GTC TCT GGA AAA CCG CAG TAT ATG GTT CTG GTC CCC TCC CTG CTC CAC 148 

Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu His 
25 30 35 40 

ACT GAG ACC ACT GAG AAG GGC TGT GTC CTT CTG AGC TAC CTG AAT GAG 196 
Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn Glu 
45 50 55 

ACA GTG ACT GTA AGT GCT TCC TTG GAG TCT GTC AGG GGA AAC AGG AGC 244 
Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg Ser 
60 65 70 

CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC CAC TGT GTC GCC 292 
Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val Ala 
75 80 85 

TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA ATG TTC CTC ACT 340 
Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu Thr 
90 95 100 

GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG CGG ACC ACA GTG 388 
Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr Val 
105 110 115 120 

ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG ACA GAC AAA TCA 436 
Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys Ser 
125 130 135 

ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT GTC TCC ATG GAT 484 
He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met Asp 
140 145 " 150 

GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA GTA TAC ATT CAG 532 
Glu Asn Phe His Pro Leu Asn Glu Leu He Pro Leu Val Tyr lie Gin 
155 160 165 

GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT TTC CAG TTA GAG 580 
Asp Pro Lys Gly Asn Arg He Ala Gin Trp Gin Ser Phe Gin Leu Glu 
170 175 180 

GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA GAG CCC TTC CAG 628 
Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser Glu Pro Phe Gin 
185 190 195 200 

GGC TCC TAC AAG GTG GTG GTA CAG AAG AAA TCA GGT GGA AGG ACA GAG 676 
Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr Glu 
205 210 215 

CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG TTT GAA GTA CAA 724 
His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val Gin 
220 225 230 

GTA ACA GTG CCA AAG ATA ATC ACC ATC TTG GAA GAA GAG ATG AAT GTA 772 
Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn Val 
235 240 245 
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TCA GTG TGT GGC CTA TAC ACA TAT GGG AAG CCT GTC CCT GGA CAT GTG 
Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His Val 
250 255 260 

ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC GAC TGC CAC GGT 
Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His Gly 
265 270 275 280 

GAA GAT TCA CAG GCT TTC TGT GAG AAA TTC AGT GGA CAG CTA AAC AGC 
Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn Ser 
285 290 295 

CAT GGC TGC TTC TAT CAG CAA GTA AAA ACC AAG GTC TTC CAG CTG AAG 
His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu Lys 
300 305 310 

AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC CAG ATC CAA GAA 
Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin Glu 
315 320 325 

GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC AGT GAA ATC ACA 
Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He Thr 
330 335 340 

AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC TCA CAC TTT CGA 
Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe Arg 
345 350 355 360 

CAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA GAT GGG AAA GGC 
Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys Gly 
365 370 375 

GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA AAT GAA GCA AAC 
Val Pro He Pro Asn Lys Val He Phe He Arg Gly Asn Glu Ala Asn 
380 385 390 

TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT GTA CAG TTC TCT 
Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe Ser 
395 400 405 

ATC AAC ACC ACC AAT GTT ATG GGT ACC TCT CTT ACT GTT AGG GTC AAT 
He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val Asn 
410 415 420 

TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG GTG TCA GAA GAA 
Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu Glu 
425 430 435 440 

CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC TCC CCA AGC AAG 
His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser Lys 
445 450 455 

AGC TTT GTC CAC CTT GAG CCC ATG TCT CAT GAA CTA CCC TGT GGC CAT 
Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly His 
460 465 470 
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ACT CAG ACA 6TC CAG GCA CAT TAT ATT CTG AAT GGA GGC ACC CTG CTG 1492 
Thr Gin Thr Val Gin Ala His Tyr He Leu Asn Gly Gly Thr Leu Leu 
475 480 485 

GGG CTG AAG AAG CTC TCC TTC TAT TAT CTG ATA ATG GCA AAG GGA GGC 1540 
Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly Gly 
490 495 500 

ATT GTC CGA ACT GGG ACT CAT GGA CTG CTT GTG AAG CAG GAA GAl ATG 1588 
He Val Arg Thr Gly Thr His Gly Leu Leu Val Lys Gin Glu Asp Met 
505 510 515 520 

AAG GGC CAT TTT TCC ATC TCA ATC CCT GTG AAG TCA GAC ATT GCT CCT 1636 
Lys Gly His Phe Ser He Ser He Pro Val Lys Ser Asp He Ala Pro 
525 530 535 

GTC GCT CGG TTG CTC ATC TAT GCT GTT TTA CCT ACC GGG GAC GTG ATT 1684 
Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr Gly Asp Val He 
540 545 550 

GGG GAT TCT GCA AAA TAT GAT GTT GAA AAT TGT CTG GCC AAC AAG GTG 1732 
Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys Val 
555 560 565 

GAT TTG AGC TTC AGC CCA TCA CAA AGT CTC CCA GCC TCA CAC GCC CAC 1780 
Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala His 
570 575 580 

CTG CGA GTC ACA GCG GCT CCT CAG TCC GTC TGC GCC CTC CGT GCT GTG 1828 
Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala Val 
585 590 595 600 

GAC CAA AGC GTG CTG CTC ATG AAG CCT GAT GCT GAG CTC TCG GCG TCC 1876 
Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala Ser 
605 610 615 

TCG GTT TAC CTG CTA CCA GAA AAG GAC CTC ACT GGC TTC CCT GGG 1924 
Ser Val Tyr Ai.i Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro Gly 
620 625 630 

CCT TTG AAT GAC CAG GAC GAT GAA GAC TGC ATC AAT CGT CAT AAT GTC 1972 
Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys He Asn Arg His Asn Val 
635 640 645 

TAT ATT AAT GGA ATC ACA TAT ACT CCA GTA TCA AGT ACA AAT GAA AAG 2020 
Tyr He Asn Gly He Thr Tyr Thr Pro Val Ser Ser Thr Asn Glu Lys 
650 655 660 

GAT ATG TAC AGC TTC CTA GAG GAC ATG GGC TTA AAG GCA TTC ACC AAC 2068 
Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys Ala Phe Thr Asn 
665 670 675 680 

TCA AAG ATT CGT AAA CCC AAA ATG TGT CCA CAG CTT CAA CAG TAT GAA 2116 
Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu Gin Gin Tyr Glu 
685 690 695 
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TGT CCA TCA GGT GGT GAG GTT TCT GAA GAA TTA TCC CTG AAA CTG CCA 2836 
Cvs Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser Leu Lys Leu Pro 
925 930 935 

CCA AAT GTG GTA GAA GAA TCT GCC CGA GCT TCT GTC TCA GTT TTG GGA 2884 
Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val Ser Val Leu Gly 
940 945 950 

GAC ATA TTA GGC TCT GCC ATG CAA AAC ACA CAA AAT CTT CTC CAG ATG 2932 
Asp He Leu Gly Ser Ala Met Gin Asn Thr Gin Asn Leu Leu Gin Met 
955 960 965 

CCC TAT GGC TGT GGA GAG CAG AAT ATG GTC CTC TTT GCT CCT AAC ATC 2980 
Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro Asn He 
970 ~ 975 980 

TAT GTA CTG GAT TAT CTA AAT GAA ACA CAG CAG CTT ACT CCA GAG ATC 3028 
Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro Glu He 
985 990 995 1000 

AAG TCC AAG GCC ATT GGC TAT CTC AAC ACT GGT TAC CAG AGA CAG TTG 3076 
Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr Gin Arg Gin Leu 
1005 1010 1015 

AAC TAC AAA CAC TAT GAT GGC TCC TAC AGC ACC TTT GGG GAG CGA TAT 3124 
Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu Arg Tyr 
1020 1025 1030 

GGC AGG AAC CAG GGC AAC ACC TGG CTC ACA GCC TTT GTT CTG AAG ACT 3172 
Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe Val Leu Lys Thr 
1035 1040 1045 

TTT GCC CAA GCT CGA GCC TAC ATC TTC ATC GAT GAA GCA CAC ATT ACC 3220 
Phe Ala Gin Ala Arg Ala Tyr He Phe He Asp Glu Ala His He Thr 
1050 1055 1060 

CAA GCC CTC ATA TGG CTC TCC CAG AGG CAG AAG GAC AAT GGC TGT TTC 3268 
Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly Cys Phe 
1065 1070 1075 1080 

AGG AGC TCT GGG TCA CTG CTC AAC AAT GCC ATA AAG GGA GGA GTA GAA 3316 
Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly Val Glu 
1085 1090 1095 

GAT GAA GTG ACC CTC TCC GCC TAT ATC ACC ATC GCC CTT CTG GAG ATT 3364 
Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala Leu Leu Glu He 
1100 1105 1110 

CCT CTC ACA GTC ACT CAC CCT GTT GTC CGC AAT GCC CTG TTT TGC CTG 3412 
Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala Leu Phe Cys Leu 
1115 1120 1125 

GAG TCA GCC TGG AAG ACA GCA CAA GAA GGG GAC CAT GGC AGC CAT GTA 3460 
Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser His Val 
1130 1135 1140 
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TAT ACC AAA GCA CT6 CTG GCC TAT GCT TTT GCC CTG GCA GGT AAC CAG 
Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly Asn Gin 
1145 1150 H55- "60 

GAC AAG AGG AAG GAA GTA CTC AAG TCA CTT AAT GAG GAA GCT GTG AAG 
Asp Lys Arg Lys Glu^Val Leu Lys Ser Leu^Asn Glu Glu Ala Val^Lys 

AAA GAC AAC TCT GTC CAT TGG GAG CGC CCT CAG AAA CCC AAG GCA CCA 
Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys Pro Lys Ala Pro 
3 V H80 H85 H90 

GTG GGG CAT TTT TAC GAA CCC CAG GCT CCC TCT GCT GAG GTG GAG ATG 
Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala Glu Val Glu Met 
1195 1200 1205 

ACA TCC TAT GTG CTC CTC GCT TAT CTC ACG GCC CAG CCA GCC CCA ACC 
Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala Pro Thr 
1210 1215 1220 

TCG GAG GAC CTG ACC TCT GCA ACC AAC ATC GTG AAG TGG ATC ACG AAG 
Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys Trp He Thr Lys 
1225 1230 1235 1240 

CAG CAG AAT GCC CAG GGC GGT TTC TCC TCC ACC CAG CAC ACA GTG GTG 
Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin His Thr Val Val 
1245 1250 1255 

GCT CTC CAT GCT CTG TCC AAA TAT GGA GCA GCC ACA TTT ACC AGG ACT 
Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr Arg Thr 
1260 1265 1270 

GGG AAG GCT GCA CAG GTG ACT ATC CAG TCT TCA GGG ACA TTT TCC AGC 
Gly Lys Ala Ala Gin Val Thr He Gin Ser Ser Gly Thr Phe Ser Ser 
1275 1280 1285 

AAA TTC CAA GTG GAC AAC AAC AAC CGC CTG TTA CTG CAG CAG GTC TCA 
Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin Val Ser 
1290 1295 1300 

TTG CCA GAG CTG CCT GGG GAA TAC AGC ATG AAA GTG ACA GGA GAA GGA 
Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly Glu Gly 
1305 1310 1315 1320 

TGT GTC TAC CTC CAG ACA TCC TTG AAA TAC AAT ATT CTC CCA GAA AAG 
Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He Leu Pro Glu Lys 
1325 1330 1335 

GAA GAG TTC CCC TTT GCT TTA GGA GTG CAG ACT CTG CCT CAA ACT TGT 
Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu Pro Gin Thr Cys 
1340 1345 1350 

GAT GAA CCC AAA GCC CAC ACC AGC TTC CAA ATC TCC CTA AGT GTC AGT 
Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser Leu Ser Val Ser 
1355 1360 1365 
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TAC ACA GGG AGC CGC TCT GCC TCC AAC ATG GCG ATC GTT GAT GTG AAG 4180 
Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He Val Asp Val Lys 
1370 ' 1375 1380 

ATG GTC TCT GGC TTC ATT CCC CTG AAG CCA ACA GTG AAA ATG CTT GAA 4228 
Met /al Ser Gly Phe He Pro Leu Lys Pro Thr Val Lys Met Leu Glu 
1385 1390 1395 1400 

AGA TCT AAC CAT GTG AGC CGG ACA GAA GTC AGC AGC AAC CAT GTC TTG 4275 
Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser Asn His Val Leu 
1405 1410 1415 

ATT TAC CTT GAT AAG GTG TCA AAT CAG ACA CTG AGC TTG TTC TTC ACG 4324 
He Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser Leu Phe Phe Thr 
1420 1425 1430 

GTT CTG CAA GAT GTC CCA GTA AGA GAT CTC AAA CCA GCC ATA GTG AAA 4372 
Val Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro Ala He Val Lys 
1435 1440 1445 

GTC TAT GAT TAC TAC GAG ACG GAT GAG TTT GCA ATT GCT GAG TAC AAT 4420 
Val Tyr Asp Tyr Tyr Glu Thr Asp Glu Phe Ala He Ala Glu Tyr Asn 
1450 1455 1460 

GCT CCT TGC AGC AAA GAT CTT GGA AAT GCT TGAAGACCAC AAGGCTGAAA 4470 
Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1465 1470 

AGTGCTTTGC TGGAGTCCTG TTCTCTGAGC TCCACAGAAG ACACGTGTTT TTGTATCTTT 4530 
AAAGACTTGA TGAATAAACA CTTTTTCTGG TCAAAAAAA 4569 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1474 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(E) FEATURES: bait region: 690-730 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Gly Lys Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 
15 10 15 

Val Leu Leu Pro Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met 
20 25 30 

Val Leu Val Pro Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys 
35 40 45 

Val Leu Leu Ser Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu 
50 55 60 

Glu Ser Val Arg Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu 
65 70 75 80 



REPLACEMENT SHEET 



WO 91/03557 PCT/DK90/00225 

42 

Asn Asp Val Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 
85 90 35 

Asn Glu Glu Val Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin 
100 1° 5 110 

Glu Phe Lys Lys Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu 
115 * 120 125 

Val Phe Val Gin Thr Asp Lys Ser He Tyr Lys Pro Gly Gin Thr Val 
130 135 I 40 

Lys Phe Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 150 155 * 

Leu He Pro Leu Val Tyr He Gin Asp Pro Lys Gly Asn Arg lie Ala 
165 170 175 

Gin Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 
180 185 190 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 
195 200 205 

Lys Lys Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glii Phe 
210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys He He Thr 
225 230 235 240 

He Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 
245 250 255 

Gly Lys Pro Val Pro Gly His Val Thr Val Ser He Cys Arg Lys Tyr 
260 265 270 

Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 
275 280 285 

Lvs Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 
3 290 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 
305 310 315 320 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 
325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 
340 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 
355 360 365 

Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 
370 375 380 
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Phe He Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 
385 390 395 400 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 
405 410 415 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 
420 425 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 
435 440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
450 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 
465 470 475 480 

He Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 
485 490 495 

Tyr Leu He Met Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly 
500 505 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser He Ser He 
515 520 525 

Pro Val Lys Ser Asp He Ala Pro Val Ala Arg Leu Leu He Tyr Ala 
530 535 540 

Val Leu Pro Thr Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val 
545 550 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 
565 570 575 

Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 
580 585 590 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
595 600 605 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 
610 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
. 625 630 635 640 

Asp Cys He Asn Arg His Asn Val Tyr He Asn Gly He Thr Tyr Thr • 
645 650 655 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 
660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys lie Arg Lys Pro Lys Met 
675 680 685 
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Cys Pro Gin Leu Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg 

690 695 700 

Val Gly Phe Tyr Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu 
705 710 m 

Val His Val Glu Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro 
725 '30 

Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala 
740 745 

Glu Val Gly Val Thr Val Pro Asp Thr He Thr Glu Trp Lys Ala Gly 
755 760 1 03 

Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly lie Ser Ser Thr Ala 
770 775 780 

Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr 
785 790 795 

Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn 
805 810 

Tyr Leu Pro Lys Cys He Arg Val Ser Val Gin Leu Glu Ala Ser Pro 
820 825 

Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He 
835 840 

Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser 
850 355 860 

Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin 
865 370 875 

Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys 
885 890 

Asp Thr Val He Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys 
900 905 

Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser 
915 92° 9Z5 

Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala 
930 935 940 

Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala Met Gin 
945 950 955 

Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn 
965 970 

Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu Asn Glu 
980 985 5,3 
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Thr Sin Gin Leu Thr Pro Glu lie Lys Ser Lys Ala He Gly Tyr Leu 
995 1000 1005 

Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser 
1010 1015 1020 

Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp 
1025 1030 1035 1040 

Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr He 
1045 1050 1055 

Phe He Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin 
1060 1065 1070 

Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 
1075 1080 1085 

Asn Ala He Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr 
1090 1095 HOO 

He Thr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val 
1105 1110 1115 1120 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 
1125 1130 1135 

Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr 
1140 1145 1150 

Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys 
1155 1160 1165 

Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 
1170 1175 H80 

Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin 
1185 1190 H95 1200 

Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 
1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr 
1220 1225 1230 

Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 
1235 1240 1245 

Ser Ser Thr Gin His Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 
1250 1255 1260 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr lie 
1265 1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 
1285 1290 1295 
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Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro GlyGlu Tyr 



1300 



Ser Met Lys Val Thr Gly Glu ffljOfiVil Tyr Leu Gin Thr Ser Leu 

1315 1320 
Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 

1330 1335 
Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Se^ 
1345 1350 

Phe Gin lie Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 
1365 1370 

Asn Het Ala He Val Asp Val Lys Met Val Ser Gly Phe lie Pro Leu 
1380 I 385 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 
1395 H00 

Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn 
1410 1415 

Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg^ 
142 5 1430 

Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp 

r 1445 1450 A ^ JJ 

Glu Phe Ala lie Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 
1460 1465 

Asn Ala 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: Y 

(iv) ANTI-SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 29.. 4480 
(0) OTHER INFORMATION: 
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(1x) FEATURE: 

(A) NAME/KEY: insertion_seq 

(B) LOCATION: 2102.. 2305 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

6TCTCCTCCA GCTCCTTCTT TCTGCAAC ATG GGG AAG AAC AAA CTC CTT CAT 52 

Met Gly Lys Asn Lys Leu Leu His 
1 5 

CCA AGT CTG GTT CTT CTC CTC TTG GTC CTC CTG CCC ACA GAC GCC TCA 100 
Pro Ser Leu Val Leu Leu Leu Leu VaT Leu Leu Pro Thr Asp Ala Ser 
10 15 20 

GTC TCT GGA AAA CCG CAG TAT ATG GTT CTG GTC CCC TCC CTG CTC CAC 148 
Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu His 
25 " 30 35 40 

ACT GAG ACC ACT GAG AAG GGC TGT GTC CTT CTG AGC TAC CTG AAT GAG 196 
Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn Glu 
45 50 55 

ACA GTG ACT GTA AGT GCT TCC TTG GAG TCT GTC AGG GGA AAC AGG AGC 244 
Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg Ser 
60 65 70 

CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC CAC TGT GTC GCC 292 
Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val Ala 
75 80 85 

TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA ATG TTC CTC ACT 340 
Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu Thr 
90 95 100 

GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG CGG ACC ACA GTG 388 
Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr Val 
105 " 110 115 120 

ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG ACA GAC AAA TCA 436 
Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys Ser 
125 130 135 

ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT GTC TCC ATG GAT 484 
He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met Asp 
140 145 150 

GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA GTA TAC ATT CAG 532 
Glu Asn Phe His Pro Leu Asn Glu Leu He Pro Leu Val Tyr He Gin 
155 160 165 

GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT TTC CAG TTA GAG 580 
Asp Pro Lys Gly Asn Arg He Ala Gin Trp Gin Ser Phe Gin Leu Glu 
170 * 175 180 
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235 



628 



rrr PfiC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA GAG CCC TTC CAG 
G?J G?y lelZs iTn Phi Ser Phe Pro Leu Ser Ser Glu Pro Phe Gin 
185 190 195 

GGC TCC TAC AAG GTG GTG GTA CAG AAG AAA TCA GGT GGA AGG ACA GAG 676 
Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr Glu 
205 210 tia 

rar rrT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG TTT GAA GTA CAA 724 
His Pro Phe Thr 5a? GlS SlS Phe Val Leu Pro Lys Phe Glu Val Gin 
220 225 230 

GTA ACA GTG CCA AAG ATA ATC ACC ATC HG GAA GAA GAG ATG AAT GTA 772 
Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn Val 
— 240 245 



TCA GTG TGT GGC CTA TAC ACA TAT GGG AAG CCT GTC CCT GGA CAT GTG 820 
Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His Val 
250 255 260 

ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC GAC TGC CAC GGT 868 
Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His Gly 
265 270 275 Z80 

GAA GAT TCA CAG GCT TTC TGT GAG AAA TTC AGT GGA CAG CTA AAC AGC 916 
Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn Ser 
285 290 295 

CAT GGC TGC TTC TAT CAG CAA GTA AAA ACC AAG GTC TTC CAG CTG AAG 964 
His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu Lys 
300 305 3iu 

AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT. GAG GCC CAG ATC CAA GAA 1012 
Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin Glu 
315 320 325 

GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC AGT GAA ATC ACA 1060 
Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He Thr 
330 335 340 

AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC TCA CAC TTT CGA 1108 
SSS Vhr lie Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe Arg 
345 350 355 360 

CAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA GAT GGG AAA GGC 1156 
Gin Gly lie Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys Gly 
365 370 375 

GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA AAT GAA GCA AAC 1204 
Val Pro He Pro Asn Lys Val lie Phe He Arg Gly Asn Glu Ala Asn 
380 385 390 

TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT GTA CAG TTC TCT 1252 
Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe Ser 
3 395 400 405 
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ATC AAC ACC ACC AAT GTT ATG GGT ACC TCT CTT ACT GTT AGG GTC AAT 1300 
He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val Asn 
410 415 420 

TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG GTG TCA GAA GAA 1348 
Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu Glu 
425 430. 435 440 

CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC TCC CCA AGC AAG 1396 
His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser Lys 
445 450 455 

AGC TTT GTC CAC CTT GAG CCC ATG TCT CAT GAA CTA CCC TGT GGC CAT 1444 
Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly His 
460 465 470 

ACT CAG ACA GTC CAG GCA CAT TAT An CTG AAT GGA GGC ACC CTG CTG 1492 
Thr Gin Thr Val Gin Ala His Tyr He Leu Asn Gly Gly Thr Leu Leu 
475 480 485 

GGG CTG AAG AAG CTC TCC TTC TAT TAT CTG ATA ATG GCA AAG GGA GGC 1540 
Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly Gly 
490 495 500 

ATT GTC CGA ACT GGG ACT CAT GGA CTG CTT GTG AAG CAG GAA GAC ATG 1588 
lie Val Arg Thr Gly Thr His Gly Leu Leu Val Lys Gin Glu Asp Met 
505 " 510 515 520 

AAG GGC CAT TTT TCC ATC TCA ATC CCT GTG AAG TCA GAC ATT GCT CCT 1636 
Lys Gly His Phe Ser He Ser He Pro Val Lys Ser Asp He Ala Pro 
525 530 535 

GTC GCT CGG TTG CTC ATC TAT GCT GTT TTA CCT ACC GGG GAC GTG ATT 1684 
Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr Gly Asp Val He 
540 545 550 

GGG GAT TCT GCA AAA TAT GAT GTT GAA AAT TGT CTG GCC AAC AAG GTG 1732 
Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys Val 
555 560 565 

GAT TTG AGC TTC AGC CCA TCA CAA AGT CTC CCA GCC TCA CAC GCC CAC 1780 
Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala His 
570 575 580 

CTG CGA GTC ACA GCG GCT CCT CAG TCC GTC TGC GCC CTC CGT GCT GTG 1828 
Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala Val 
585 590 595 600 

GAC CAA AGC GTG CTG CTC ATG AAG CCT GAT GCT GAG CTC TCG GCG TCC 1876 
Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala Ser 
605 610 615 

TCG GTT TAC AAC CTG CTA CCA GAA AAG GAC CTC ACT GGC TTC CCT GGG 1924 
Ser Val Tyr Asn Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro Gly 
620 625 630 
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GCT GAG GTA 

<n„ ri.i Ual . ... . . 

765 770 

GCA GGG GCC TTC TGC CTG TCT GAA GAT GCT GGA CTT GGT ATC TCT TCC 
Ala Gly Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly lie Ser aer 



GTG GCT GAG GTA GGA GTA ACA GTC CCT GAC ACC ATC ACC GAG TGG AAG 
Val Ala Glu Val Gly Val Thr Val Pro Asp Thr He Thr Glu Trp Lys 



780 785 790 

apt rrr ITT CTC CGA GCC TTC CAG CCC TTC TTT GTG GAG CTC ACA ATG 
S Su £ £ ™ Ala Ke Gin Pro Phe Phe Val Glu Leu Thr Met 
795 800 003 . 

CCT TAC TCT GTG ATT CGT GGA GAG GCC TTC ACA CTC AAG GCC ACG GTC 2500 
Pro Tyr ler Sal lie Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val 



810 815 820 

CTA AAC TAC CTT CCC AAA TGC ATC CGG GTC AGT GTG CAG CTG GAA GCC 

Si ten tJ? Eel Pro Lys Cys He Arg Val Ser Val Gin Leu Glu Ala 

825 830 835 °™ 

TCT CCC GCC TTC CTA GCT GTC CCA GTG GAG AAG GAA CAA GCG CCT CAC 

Ser Pro Ala Pne LeS Ala Val Pro Val Glu Lys Glu Gin Ala Pro His 

845 850 033 



1972 
2020 
2068 



50 

635 640 

650 65 5 660 

PAT ATG TAC AGC TTC CTA GAG GAC ATG GGC TTA AAG GCA TTC ACC AAC 
tej Set lyr Ser Se Leu Glu Asp Met Gly Leu Lys Ala Phe Thr ten 
665 670 0/3 

AAG ATT CGT 

I TIa Awi ^ |1W . „ 

685 ' 690 

GCC GGC GCC GTG GGA CAG GGA TAT TAT GGA GCC GGA CTG GGA GTG GTG 
Ala Gly Ala Val Gly Gin Gly Tyr Tyr Gly Ala Gly Leu Gly Val Val 
700 705 ,lv 

sr. % s s a £ ss SB S 5 s g as s a 2212 

715 720 '« 

AAT AAT GAG CAG AGC TCA GGA CCT GTG CCT GAG ACA GTG AGG AAG TAT 2260 
ton ten Glu Gin Ser Ser Gly Pro Val Pro Glu Thr Val Arg Lys Tyr 
730 735 740 

TTC CCT GAG ACA TGG ATC TGG GAT CTG GTG GTG GTG AAT TCC GCG GGT 
Phe Pro Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala biy 



s « s ss ss £ ^ a s s 88 2S a B 3 B 21,6 



2164 



2308 



2356 



2404 



2452 



2548 



2596 
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TGC ATC TGT GCA AAC GGG CGG CAA ACT GTG TCC TGG GCA GTA ACC CCA 
Cys He Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro 
860 865 B/u 

AAG TCA TTA GGA AAT GTG AAT TTC ACT GTG AGC GCA GAG GCA CTA GAG 
§t Ser Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu 61 u 
875 880 88b 

TCT CAA GAG CTG TGT GGG ACT GAG GTG CCT TCA GTT CCT GAA CAC GGA 
Ser Gin Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly 
890 895 9°° 

AGG AAA GAC ACA GTC ATC AAG CCT CTG TTG GTT GAA CCT GAA GGA CTA 
Arg Lys Asp Thr Val He Lys Pro Leu Leu Val Glu Pro Glu Gly Leu 
905 910 915 MU 

GAG AAG GAA ACA ACA TTC AAC TCC CTA CTT TGT CCA TCA GGT GGT GAG 
Glu Lys Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu 
g25 930 »Jb 

GTT TCT GAA GAA TTA TCC CTG AAA CTG CCA CCA AAT GTG GTA GAA GAA 
Val Ser Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu 
940 945 9!>0 

TCT GCC CGA GCT TCT GTC TCA GTT TTG GGA GAC ATA TTA GGC TCT GCC 
Ser Ala Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala 
955 960 96b 

ATG CAA AAC ACA CAA AAT CTT CTC CAG ATG CCC TAT GGC TGT GGA GAG 
Met Gin Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu 
970 975 980 

CAG AAT ATG GTC CTC TTT GCT CCT AAC ATC TAT GTA CTG GAT TAT CTA 
Gin All Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu 
985 990 99b 

AAT GAA ACA CAG CAG CTT ACT CCA GAG ATC AAG TCC AAG GCC ATT GGC 
Asn Glu Thr Gin Gin Leu Thr Pro Glu He Lys Ser Lys Ala He Gly 
1005 1010 10 15 

TAT CTC AAC ACT GGT TAC CAG AGA CAG TTG AAC TAC AAA CAC TAT GAT 
]yl Leu Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp 
1020 1025 1030 

GGC TCC TAC AGC ACC TTT GGG GAG CGA TAT GGC AGG AAC CAG GGC AAC 
Gly Ser Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn 
1035 1040 1045 

ACC TGG CTC ACA GCC TTT GTT CTG AAG ACT TTT GCC CAA GCT CGA GCC 
Thr TrJ Leu Th? Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala 
1050 1055 1060 

TAC ATC TTC ATC GAT GAA GCA CAC ATT ACC CAA GCC CTC ATA TGG CTC 
\fr He Phe He Asp Glu Ala His lie Thr Gin Ala Leu He Trp Leu 
1065 1070 1075 1080 



2644 



2692 



2740 



2788 



2836 



2884 



2932 



2980 



3028 



3076 



3124 



3172 



3220 



3268 
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1275 I 280 
1290 1295 



3316 
3364 
3412 



3460 
3508 
3556 



B SS g 2$ 5 5 S5 S 5 S S a 2 8 & 2 

1085 AU 

aaa&BSsasaaasaaB 
s< a s n° e £a s s as a c - a as a a 

1115 1120 
OCT GTT GTC CGC AAT GCC CTG TTT TGC «. W T« GCC W £ «J 
Pro Val Val Arg Asn Ala Leu Phe Cys Leu ^ 

1130 1135 
GCA CAA GAA GGG GAC CAT GGC AGO CAT GTA TAT ACC AAA GCA CTG CTG 
Ala Gin Glu Gly Asp His Gly Ser his n60 
1145 

ssEESSRsaassssaa 

1165 

CTC AAG TCA CTT AAT GAG GAA GOT GTG AAG AAA GAC AAC TOT GTC GAT 360, 
Leu Lys Ser Leu^Asn Glu Glu Ala J H90 

TGG GAG CGC CCT CAG AAA CCC AAG GCA CCA GTG GGG CAT TTT TAC GAA 
Trp Glu Arg Pro Gin Lys Pro Lys Ala pro 
1195 1200 

ss a au a e a a as a a s as a a a 

1210 121b 

a sj a s a its a a a a & a s c P a s s 

1225 1230 rrr 

asaassssagaaaas.s 

1245 

GGT HC TCC TOO ACC CA6 CAC ACA GTG GTG GCT CTC CAT GCT CTG TCC 
Gly Phe Ser Ser Thr Gin His inr »»'»*■ 12 7o 
1260 " w 

sssaas.Kassssaaass 



3652 
3700 
3748 
3796 
3844 
3892 



3940 
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a a s ss a a a % s a a a a a s | o 



3988 



1305 1310 

as a a a egs? a a a ja §s a a ga 
s a $ a a a a a a a a a a as a 

1340 1345 « au 

TTA GGA GTG CAG ACT CTG CCT CAA ACT TGT GAT GAA CCC AAA GCC CAC 
Leu Gly Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His 
1355 1360 . 

fl rr arr TTf CAA ATC TCC CTA AGT GTC AGT TAC ACA GGG AGC CGC TCT 
Thr Ser Phe Gl5 He £ S Ser Val Ser Tyr Thr Gly Ser Arg Ser 
1370 1375 13BU 

KeaBBaffiaaaaBSSiSjG 

1385 1390 1395 

CCC CTG AAG CCA ACA GTG AAA ATG CTT GAA AGA TCT AAC CAT GTG AGC 
Pro Leu Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser 
1405 1410 

CGG ACA GAA GTC AGC AGC AAC CAT GTC TTG ATT TAC CTT GAT AAG GTG 
Ar ?S G?5 55 Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val 
1420 I 425 

ier a a a a a a s'a a a a a a a 

1435 I* 40 
6TA AGA GAT CTG AAA CCA GCC ATA GTG AAA GTC TAT GAT TAC TAC GAG 4420 
Val Arg Asp Leu Lys Pro Ala lie Val Lys Val ijriwP 'J r 'J r ° 
1450 1455 i<l0U 

arr PAT GAG TTT GCA ATT GCT GAG TAC AAT GCT CCT TGC AGC AAA GAT 4468 
Thr Z SS S AU !le Ala Glu Tyr Asn Ala Pro Cys Ser Lys A^ 



4036 



4084 



4132 



4180 



4228 



4276 



4324 



4372 



4520 



1465 1470 1475 

CTT GGA AAT GCT TGAAGACCAC AAGGCTGAAA AGTGCTTTGC TGGAGTCCTG 
Leu Gly Asn Ala 

TTCTCTGAGC TCCACAGAAG ACACGTGTTT TTGTATCTTT AAAGACTTGA TGAATAAACA 4580 

4599 

CTTTTTCTGG TCAAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1484 amino acids 

(B) TYPE: amino acid 
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{M TOPOLOGY: linear 
(E) FEATURES: bait region: 690-740 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Gly Lys Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 



Val Leu Leu Pro Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met 

20 25 
Val Leu Val Pro Ser Leu lee His Thr n. Thr Thr Glu Lys Gly Cys 

35 *° 
Val Leu Leu Ser Tyr Leu As; Glu Thr Val Thr Val Ser Ma Ser Leu 

50 55 
«. Ser Val Ar, Sly Asn Ar, Ser Leu Phe Thr Asp Leu 61. Ala Glu 
65 70 

A,n Asp V.1 Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 
M, Glu Glu Val Met Phe Leu Thr Val Glo Val lys Gly Pro Thr Gin 



100 105 



Glu Phe Lys lys Ar9 Thr Thr V.1 let Val Lys Asn Glu Asp Ser Leu 



115 



M Phe Val Gin Thr Asp lys Ser He Tyr Lys Pro Gly Gin Thr Val 



130 135 
L,s Phe ft, Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 

Leu lie Pro Leu W Tyr He Gin Asp Pro Lys Gly Asn Arg II. Ala 

II. Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin m Ser Phe 

180 103 
Pr . Leu Ser Ser Glu Pro Phe Gin Gl, Ser Tyr Lys Val Val V.1 Gin 

195 200 
Lys Lys Ser Gly Gl, Ar, Thr Glu His Pro Phe Thr Val Glu Glu Phe 

210 215 
Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys He He Thr 
225 230 

He Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 



245 250 

260 



Gly Lys Pro Val Pro Gly His Val Thr Val Ser He Cys Ar, Lys Tyr 
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Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 

275 280 
Lys Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 

290 295 
Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 
305 310 31!> 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 

325 330 
Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 



340 345 350 

._. Asp 

355 360 



Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 



Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 

370 375 380 

Phe lie Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 
385 390 3ya 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 
405 410 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 
420 425 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 
435 440 ^ 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
450 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 

465 470 4,73 

He Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 

Tyr Leu He Met Ala Lys Gly Gly lie Val Arg Thr Gly Thr His Gly 
500 505 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser He Ser He 

515 520 
Pro Val Lys Ser Asp He Ala Pro Val Ala Arg Leu Leu He Tyr Ala 

530 535 MU 

Val Leu Pro Thr Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val 
545 550 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 
565 570 
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Ser L e« Pro Ala Ser His Ala «« jg *1 « * Pr ° 61 " 

580 58S 
Ser „, Cys Ala Leu Ar, Ala Va, Asp Gin Ser Val Leu Leu Het Lys 

Pro Asp Z II. Ser «. Ser Ser Val Tyr As; Leu Leu Pro (I. 

610 615 
^ Asp Leu Thr By Phe Pro Gly P™ 1- Asn Asp «. Asp Asp Glu 
625 630 

tep cys II. Aso Are His Asn Val Tyr lie Asn Gly He Thr Tyr Thr 

645 

Pr „ va, Ser Ser Thr Asn Glu Lys Asp Het Tyr Ser Phe Leu G,u Asp 

660 665 
Het G„ Leu Lys Ala Phe Thr Asn Ser Lys He Are Lys Pro Lys Het 
675 

C ys Pro Gin Leu n. Ser Va, Ser Ala G,y Ala Val Gl, Gin Gly Tyr 
690 695 



Tyr 61, Ala Gl, Leu Gl, Val Va, «. Arg Pro T,r V„ Pro «. Leu 
705 710 

Gl, Thr T»r Asn Val He Pro Leu Asn Asn Glu Gin Ser Ser Gly Pro 
725 

V„ Pro Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp He Trp Asp 

740 /HQ 
L eu Va, V.1 Val Asn Ser Ala Gly Val Ala Glu Val Gly Val Thr Val 

755 /bU 
Pro Asp Thr II. Thr G,u Trp Lys Ala Gly Ala Phe Cys Leu Ser Glu 

770 775 
Asp A,. Gl, Leu G„ lie Ser Ser Thr Ala Ser Leu Arg «,» Phe Gin 
785 790 

Pro Ph. Ph. Val Glu Leu Thr Met Pro Tyr Ser Val He Arg GTy Glu 

805 olu 
Ala Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys Cys 11. 

820 ^ 
srg ,al Ser Vol Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala Val Pro 
835 



,a, G,u Lys G,u G,n Ala Pro His Cys He Cys AT. Asn G,» Arg GIn 
850 855 



Tnr Va, ser Trp Al. Val Thr Pro Lys Ser Uu Gly Asn Val Asn Phe 
865 870 
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Thr Val Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly Thr Glu 
885 890 

Val Pro Ser Val Pro Glu His Gly Arg Lys Asp Thr Val lie Lys Pro 
900 905 

Leu Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe Asn Ser 
915 9 20 9 " 

Leu Leu Cys Pro Ser Gly Gly.Glu Val Ser Glu Glu Leu Ser Leu Lys 
930 9 35 940 

Leu Pro Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val Ser Val 
945 950 955 

Leu Gly Asp He Leu Gly Ser Ala Met Gin Asn Thr Gin Asn Leu Leu 
965 9 '° 3/0 

Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro 

980 985 
Asn lie Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro 

gg5 1000 XVUD 

Glu lie Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr Gin Arg 
1010 1015 1020 

GlnUu Asn Tyr Lys His^Tyr Asp Gly Ser Tyr^Ser Thr Phe Gly GlU Q 

Arg Tyr Gly Arg Asn^Gln Gly Asn Thr Trp^Leu Thr Ala Phe Val^Leu 

Lys Thr Phe Ala Gin Ala Arg Ala Tyr lie Phe He Asp Glu Ala His 
1060 1065 1 

lie Thr Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly 
1075 1080 

Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly 

1090 IO 95 1100 

Val Glu Asp Glu Val Thr Leu Ser Ala Tyr lie Thr He Ala Leu Le^ 
1105 I 110 

Glu lie Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala Leu Phe 
1125 1130 

Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser 
U40 1145 1AW 

His Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly 
1155 H 60 11 

Asn Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala 
1170 II 75 11 
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Lys Lys Asp Asn Ser Val His Trp II. Ar^Pro Gin I*. Jg, 
1185 1190 

Ma Pro V.l Gly H|S 5 Phe Tyr Glu NOB. ft. Ser AU GUV.T 

n. Met Thr Ser Tyr V,! Leu Leu Ala Tyr Leu Thr Ala GUPr. AU 

1220 1ZZ!> 
„, Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He WUor. Trp He 

1235 1240 
Tnr L ys «. Gin Asn Ala «, , Gl, «y * Ser Se^Thr Gln His Thr 

1250 1255 
M Val Ala Leu His Ala Leu Ser Lys Tyr HjM. Ala Thr Phe » 
1265 1270 

Arg Thr Gly Lys AU AU Gin Val Thr IU Gin Ser Ser Gly Throne 

Ser Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu LeuO. Gin 

1300 130S 
M ser Leu Pre Glu Leu Pro Gly Glu Tyr Ser Met lg*l Thr Gly 

1315 1320 
CI. Gly cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn lie Leu Pro 

1330 1335 
D. Lys Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu Pro GU 
1345 1350 

Thr Cys Asp Glu Pro^Lys Ala His Thr SerNe Gin lie Ser Lender 

Val Ser Tyr Thr Gl, Ser Arg Ser AU Ser Asn Met AU jleV.l Asp 

1380 138i) 
,., L,s Het.V.l Ser Gly Phe lUPr. Leu Lys Pro » W Lys Het 



1395 



. « mc u a i Ser Ara Thr Glu Val Ser Ser Asn His 
Leu Glu Arg Ser Asn His Val Ser Arg mr u ^ 

1410 1,15 
M Leu lie Tyr Leu Aso Lys Val Ser Asn Gin Thr Leu Ser Leu Pta 
1425 1430 

Phe Thr V.1 Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro AU II. 



1445 



Val L,s V.! Tyr.Asp Tyr Tyr Glu Thrtop Glu Phe Ala JUAU Glu 



1460 



Tyr Asn Ala Pro Cys Ser Lys AipUu Gly Asn Ala 



1475 
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PATENT CLAIMS 

1 A process for the production of recombinant ot-macroglobulln, 

variants, fragments or derivatives thereof, wherein a functionally operative 
expression vector comprising a gene encoding for the expression of o- 
5 macroglobulin, variants, fragments or derivatives thereof, or alleles of 
such a gene, is introduced into a suitable host capable of expressing said 
gene said host is cultured in a suitable nutrient medium containing sources 
of assimilable carbon and nitrogen and other essential nutrients, and the 
expressed a-macroglobulin or fragments or derivatives thereof is recovered. 



10 



2 The process of claim 1, wherein said gene encodes for the 

expression of human a 2 -macroglobulin, variants, fragments or derivatives 
thereof. 

15 3 The process of claim 2, wherein said gene encodes for the 

expression of human ^-macroglobulin having the amino acid sequence of SEQ 
ID N0:2, or a fragment or derivative thereof. 

4. The process of claim 2 or 3, wherein said gene comprises the DNA 

20 sequence of SEQ ID NO:l, or a fragment thereof. 

5 The process of claim 1 or 2, wherein said gene encodes for a 

variant a-macroglobulin, in which the amino acid sequence of the bait region 
has been altered. 



25 



6. The process of claim 5, wherein the bait region has been altered 
by incorporation of further proteinase target sites. 

7. The process of claim 5, wherein the bait region has been altered 
30 by removal of proteinase target sites. 

8 The process of claim 5, wherein the bait region has been altered 

by replacing one or more specific proteinase target sites with one or more 
other specific proteinase target sites. 

35 

g. The process of claim 8, wherein said proteinase target sites are 

specific for bovine trypsin, Strpptomvces griseus trypsin, papain, porcine 
elastase, bovine chyroosin, bovine chymotrypsin, Sta phylococcus aureus strain 
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»8 proteinase, _ plas-in, bovine thrown, thermolysin. subtllisin Move , 
aniti-.tr arffltumas °riseus proteinase B. . 

,0 The process of claim 5, wherein wherein the bait region has been 

5 "iered by repUcin, said bait region or part thereof with a bait reg,on or 
a part thereof fro» another e-iaacroglobulin. 

„ The process of claim 10, wherein said bait regions originate fro. 

L Pregnane, Zone Protein ( PZP,, rat * rat rat o,L vanant 
10 1, or rat .,1, .orient 2 CL - o,-1nhibitor 3), espeaally PZP. 

l2 The process of any of claims 5 to 11, wherein said gene encodes 

for the expression of honan a ^acrogiobuiin variant having the am,no acd 
sequence of SEQ ID HO:*, or a fragment or derivative thereof. 

15 13 The process of any of claims 5 to 12, wherein said gene comprises 

the ONA sequence of SEQ 10 H0:3, or a fragment thereof. 

14 . The process of any of the claims 1 to 13, -herein said gene is 

20 a synthetic gene* 

15 The process of any of the claims 1 to 14, wherein said «- 

LoglobuUn. variant, fragment or derivative thereof is co-expressed w,th 
a desired gene product. 
25 16 . The process of any of the claims 1 to 15, wherein said gene is, 

or is derived from, a human gene. 

17 The process of any of the claims 1 to 16, wherein said host is 

30 a bacterial strain, a fungal strain, a mammalian cell line, or a mammal. 

18. The process of claim 17, wherein said host is a fungus. 

19 . The process of claim 18, wherein said fungus belongs to the genus 
35 /Vsppraillus . 

20 The process of claim 18, wherein said host is a yeast. 
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21. The process of claim 20, wherein said yeast belongs to the genus 
Saccharomvces . 

22. The process of claim 17, wherein said host is a mammalian cell 
5 line. 

23. The process of claim 22, wherein said mammalian cell line is a 
Syrian Baby Hamster Kidney (BKH) cell line. 

10 24. The process of claim 23, wherein said cell line is available from 

ATCC under No. CRL 1632. 

25. A DNA sequence comprising a gene encoding for the expression of 

an a-macroglobulin, variants, fragments or derivatives thereof. 

15 26 The DNA sequence of claim 25, wherein said gene encodes for human 

c^-macroglobulin. 

27. The DNA sequence of claim 25, wherein said gene encodes for the amino 
20 acid sequence of SEQ ID N0:2 or a fragment or derivative thereof. 

28. The DNA sequence of claim 26 or 27, wherein said gene has the 
nucleotide sequence of SEQ ID N0:1 or a fragment thereof. 

25 2g. The DNA sequence of claim 25 or 26, wherein said gene encodes 

for a variant 0 -macroglobulin, in which the amino acid sequence of the bait 
region has been altered. 

30. The DNA sequence of claim 29, wherein said bait region has been 
30 altered by incorporation of further proteinase target sites. 

31. The DNA sequence of claim 29, wherein said bait region has been 
altered by removal of proteinase target sites. 

35 32 The DNA sequence of claim 29, wherein said bait region has been 

altered by replacing one or more specific proteinase target sites with one 
or more other specific proteinase target sites. 
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33 The DNA sequence of claim 29, wherein, wherein said proteinase 
taraet sites are specific for bovine trypsin, Streptomvces qpseu s trypsin, 

a , orpine elLtase, bovine chymosin, bovine chymotrypsin / 
LLaurls strain V8 proteinase, human plasmin, bovine thrombin, thermoly- 
5 sin, subtilisin Novo and/or Strpptomyces griseus proteinase B. 

34 The DNA sequence of claim 29, wherein the bait region has been 
aliered by replacing said bait region or part thereof with a bait region or 
a part thereof from another cr-macroglobulin. 

10 35 The DNA sequence of claim 34, wherein said bait region originates 

from, human Pregnancy Zone Protein (P2P), rat a,M, rat <gt. rat «,!, 
variant 1, or rat aj, variant 2, especially PZP. 

15 36 A functionally operative expression vector comprising a gene in 

accordance with any of the claims 25 to 35 for the expression of human ^ 
m acroglobulin, variants, fragments or derivatives thereof, or alleles of 
such a gene. 

20 37 The vector of claim 36, further comprising regulatory elements 

necessary for the stable maintenance of said vector in mammalian cells. 

38 The vector of claim 36 or 37, further comprising sequences 

providing for the processing and secretion of the expressed product. 

" 39 . The vector of any of the claims 36 to 38, further comprising one 

or more other genes encoding' for a desired gene product. 

40 A functionally operative expression vector comprising a gene 
30 encoding for the expression of an a-macroglobulin, variants, ^» «* 

derivatives thereof, or alleles of such a gene, essentially as described. 

41 A transformed host comprising a functionally operative expression 
vector comprising a gene encoding for the expression of human c-macro- 

35 globulin or fragments or derivatives thereof, or alleles of such a gene. 

42. The host of claim 41, wherein said vector is the vector of any 

of the claims 36 to 40. 
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43. The host of claim 41 or 42, wherein said host is a bacterial 
strain, a fungal strain, a mammalian cell line, or a mammal. 

44. • The host of claim 43, wherein said host is a fungus. 

5 

45. The host of claim 44, wherein said fungus belongs to the genus 
Aspergillus. 

46. The host of claim 44, wherein said host is a yeast. 

10 

47. The host of claim 46, wherein said host belongs to the genus Sac^ 
charomvces . 

48. The host of claim 43, wherein said host is a mammalian cell line. 

15 

49. The host of claim 48, wherein said host is a Syrian Baby Hamster 
Kidney (BHK) cell line. 

50. The host of claim 49, wherein said cell line is available from 
20 ATCC under No. CRL 1632. 

51. Recombinant human 0,-macroglobulin of SEQ ID N0:2 or SEQ ID N0:4 
in an active form. 

25 52. Recombinant a-macroglobulin, variants, fragments or derivatives 

thereof produced by a process of any of the claims 1 to 24. 

53. Recombinant a-macroglobulin, variants, fragments or derivatives 
thereof of claim 52 produced by the use of a vector of any of the claims 36 

30 to 40. 

54. Recombinant a-macroglobulin, variants, fragments or derivatives 
thereof essentially as described. 

35 55. Recombinant human a 2 -macrogl obul in, variants, fragments or 

derivatives thereof essentially as described. 

56. A growth medium comprising one or more a-macrogl obul ins. 
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57 A grovrth medium comprising recombinant o-macroglobulin, variants, 
fragments or derivatives thereof according to any of the claims 51 to 55. 

58 Use of recombinant o-macroglobulin, variants, fragments or 
5 derivatives thereof according to any of the claims 51 to 55 as a protein 

carrier in enzyme replacement therapy. 

59 Use of recombinant a-macroglobulin, variants, fragments or 
derivatives thereof according to any of the claims 51 to 55 as a DNA carrier 

10 in gene therapy. 
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